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Dr Kar Pearson has drawn my attention to the fact that in his recent paper 
on the P,, Test for Randomness in Sampling*, he had judged roughly in which 
direction the probability integrals should be measured, endeavouring to choose that 
direction which would lead to the more stringent result, Le. that in which the 
probability of a more improbable result occurring would be the smaller. Consider 
the simple case of an individual drawn at random from a normal parent population. 
If in the ordinary notation we found $(1—a@)=°005, we should argue that such an 
individual was very unlikely to have been drawn in a single random sampling, for it 
deviates from the mean value by about 2°58 times the standard deviation in excess, 
but actually the improbability would be as great in the case of an individual 
deviating from the mean value by 2°58 in defect, in which case the probability 
integral measured in the sume direction would be -995. Now, if we have two prob- 
ability integrals p; and p2, we may either take p; x pe or (1 —py)(1— pe) but we 
cannot take p; (1 — pe) or (1 —p1) pe, for the proof of the randomness in this test 
depends on the probability integrals being measured in the same direction. 
Accordingly in this-case p;p2 will be larger than (1—,)(1—e) if pi+pe be 
greater than unity. But when we increase the number of probability integrals 
there is no easy method of at once determining whether 


Pip2---Pn or (1 —pr) (1 — pe)... (1 — pn) 
will give the more stringent test. These remarks apply equally well when the 
frequency distribution of the parental population, true or hypothesised, is skew. 


In order to ascertain whether any strikingly different results would arise if the 
probability integrals in Pearson’s illustrations had been measured in the opposite 
sense, I have worked out the values of — logigo(1 — pi) (1 — ps) ... (1 — pn) for those 
illustrations and determined the corresponding values of Q,,=1—P,,. This seemed 
practically as short as comparing the two products 


Pipe---Pn and (1 —p,)(1 — pe) ...(1 — pn), 
for it is actually the value of J (n— 1, w) we require for relative appreciation. 


* Biometrika, Vol, xxv. pp. 379—410. 
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2 P,,, Test for Randomness 


The following Table I gives the value of Q,, for the various illustrations, and 
denotes which method leads to the more stringent results: @,, measures the 
probability of less probable cases occurring. The smaller it is, the less stringent 
the test. 











TABLE I. 
Illustration Table and Page, Value of Qa Value of Q, from B apeband 
i i n n ° 

Biometrika, Vol. xxv. hee sa, 10 a9) stringent 
(i) Table I, p. 387 "118 "343 A 
(ii) Table III, p. 390 ‘070 “O74 A 
(iii) Table IV, p. 391 *738 520 B 
(iv) Table V, p. 392 “000,008 995,351 A 
(v) Table VII, p. 396 037 242 A 
(vi) Table VIII, p. 397 "649 *888 A 
(viii) Table IX, p. 400 125 345 A 
(ix) Table XI, p. 403 048 ‘976 A 
(x) Table XII, p. 405 *436 +393 B 
(xi) Table XIII, p. 406 524 431 B 
(xii) Table XIV, p. 409 221 *653 A 
Given below} Table II below 575 388 B 























Thus in 8 out of 12 illustrations the more stringent direction of the probability 
integrals was selected by mere inspection. In the other 4 cases B ought to have 
been taken instead of A, but in none of these four cases was the difference such as 
to upset the judgment as to randomness deduced from A. The results, however, 
might modify a judgment as to one hypothesis being more or less probable than a 
second. The departure from randomness, represented by Q,,, =°5, may be in excess 
or defect and we must necessarily be as cautious of the approach of Q,, to 1 as 
to 0. Hence in cases of doubt it may be desirable to work out the values of Q,, by 
B as well as A. 


A further illustration of the method may be made here, namely to testing 
whether a series of Regression Coefficients may be taken as random samples from a 
bivariate normal population. We reproduce the theoretical discussion and its 
application to a particular example which has already been given in the Annals of 
Eugenics, as it is likely to escape many readers of Biometrika. 


he distribution of the regression coefficient R,.,, which we will write R, for 
the tth Age Group, is in samples of n, given by* 
1 - 
fa? + (R, —b)}3 


s=Z,a"%} 


where a and b are constants of the parent population, of which we are in ignorance. 


* RS, Proc. Vol. 112 A, pp. 1—14, or Tables for Statisticians, Part n, p. cli. 
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If we suppose a and b to be the same for all the sampled populations, then we have 
the combined surface in the case of k samples, if S(n,) = N, 
t=k 1 
z=2,) a’— II 
t=1 fa? + (Ry — 5} 
To determine the most probable values of the constants a and b we must make 
za maximum, or taking logarithmic differentials of z with regard to a and b: 


N-k t=k nya 





... oe a? +(R,— by? si: Gas 
t=t n4(R,—b) Bn ie ; 
t=-1 2 +(R,—bP 

Thus to determine a? and b we have the equations 

t=k ny N-k am 

aa a?+(R,—byp = eam werccccccccccccccsesesecccces (iv), 

> aglt N-k 

Ss _—_** GG ag ans nntesiconaunneennenneene : 

1=1 a? + (R, — bP? b a? (v) 


These equations appear to be only capable of solution by approximation, the chief 
difficulty being to obtain the first approximation. 
Returning to Equation (i) we see that if n, be considerable, we may replace it by 


and accordingly (ii) by 





Again tuking logarithmic differentials we have 


b= 8 — daitusthan La eameecone (viii), 
t=1 
a= =f S me ( = < eee SPE ENE oP (ix). 


Thus the first approximation to b is given by b= the weighted mean of R,’s and 
the first approximation to a? is a;* equal V/k times the weighted standard deviation 
of the R,’s. If we now put b=, + 4 and a?= a? +e’ in (iv) and (v), where ¢ and 
e;’ are small quantities, the squares and products of which may be neglected, we can 
proceed to a second approximation and if needful to higher approximations. 
Substituting these values of a and 6 in (i) we obtain by aid of the Tables of the 
Incomplete B-Function the probability integral p; of each Re. 


Illustration. Regression Coefficients of Sitting Height on Stature. As the method 
of determining whether a number of regression coefficients may be considered as 
due to random sampling from one and the same population is of interest from the 


i—2 





























IShZF- = (OG98‘THO-S ‘LZ) I -°S0S6'LI08: = 9 goes: = ty = AL 
GOLO'863-°S = 2°'BOT UA [SSL‘8Zt9: = 2? = (au) g 
9999°S98¢-1l — = “y°"So] 8980°S000-81 = 2(’ay — *a) *ugy 0169-2689 6669 
6969°S19-T gcoze. | —cF-0 + 1Z06‘1890- 6800°8000- | €8Z0- + CFOS: 191 18-29 
FSGS‘ LEST OLLIZ 8L-0 + ~0206°L890- 6F88°0Z00- | = LEO + 6188: 981 19-9¢ 
Z000°6SF- I PLL8S- 99-0 + FOLS*E990- 0069°100- | OLgo-+ ZELS 9FI | gg-ze 
98E9°ZS8-T 9ZEIL 99-0 — 6LFE6ECO- 6081°6000- £080- — 6908: | Ie | Ig-8F 
C6IF EFF I O9LLZ- 62-0 + OLIL‘ZL¢0- |. 69CeTI00- | Lego. + 6698- 961 | Leg 
GZ89'89L+s 9OL8E- 6S-0 — 9069°06F0- 6FFTT000- LOLO- — era 19 |S lOW-OP 
00L6'869:1 00008: 00-0 — ZPZ9‘6FFO- 1000°0000- 1000: — 19E8- sig | I-6¢ 
i ZE6S*LEL‘T Z96ES- 481-0 — L8ES‘Z6F0- 9LFS‘0000- =| «= FL00- — 88z8- coz | 8sé-Le 
% 9FLI‘996°1 LOSZ6- 1 — $E98°88F0- FO8Z‘6FO0- =| ZOLO-— O99L: 692 | 98-98 
= 8696°LE9-1 LPPEF- 4091-0 + 6E9T6LFO- 1#z9°0000- | «= 6L00- + I¥¥8- 08z FE-E 
PS LIGOSSL-L O6LZS- L0-0 — ZOOL 6SFO- 91T0000- =| «= F800" — 8ZE8- GO zé-18 
S SSLL*SHO-T OETPF- +9F1-0 + 6FSZTLE0- 6889°0000- | €800- + CPPS: L61 0€ 
> CFFO'SFF-I 9608: 8¢-0 + ~0206°L890- ¥969‘T100- ZFE0- + FOLS: 981 6Z 
S TEE FOL-1 IZ18¢- +908-0 — ZG9LFSSO- OOFFTO00- «| O10 — ZPCS- 881 8% 
Re ZLLO‘9ZSI ES89T- 96-0 + 6FFI'LLEO- 60890600" | ggg0- + S168: $61 LZ 
ZSFL‘OSS-T F880L: C¢-) — SIZL‘CEco- 98F9°8000- | = F620 — 8908- FE 9% 
S L6L8‘°809-1 9LZZE- 9F-0 + SIZL‘SESO- 62009000: OFZ0- LO98- FE & 
<< CELT S61 €L668- 8Z-1 — 6009 °E8F0- T918‘8800- 6190: - | PLL GLZ ¥% 
> ZESSFYS-I £6998: re-O+ | ZO9L‘TSFO- 60F€°Z000- €S10- + G1gs- cig €Z 
& 8688°886-I F9898- Z0-l — 6EF9TLFO- F8L8‘LZ00- 8ZS0- — FESL- 682 rad 
S 0926 °SFE-% ST1Z0- €0- + LSIS‘L¥FO- 18Z9°Z800- |  6060-+ ILZ6- 1z8 1z 
- LELG“6ES-1 I98LI- ¥6-0 + L6F0‘LOFO- F26SF100- =| «= Z8E0- + PELE: 88¢ 0Z 
a £669°866-1 LESS6- 81-2 — 1688°ECF0- 1802°8600- 1660- — ILEL: org 61 
Z90Z‘8S8-Z cPIZLO- 9F-1 + F616 ‘SFFO- 6FOTEFO00- Lego: + 6106- 61 8I 
1996‘T#8-1 L6¥69- 19-0 — +SZ9F ZLEO- #9¢8°C000- ZFZO- — OZT8: 88Z LI 
O98F'98S-1 16¢8¢- 62-0 + S9ZL*6ISO- 60F8°Z000- S10: + C1gs- 8&z 91 
-SFLFSIS-1 L66Z8- FF-0 + LOOL‘OFFO- #989°000- Z610- + Fees: 18 SI-€I 
S1L6°L66°1 FES66- 19% — —SPE8°C080- LOZPTFFO- 101z- — 19Z9- 66 Z1-9 
1A Fo} id *u r/o *u A 0 aay — #27) tar — Far far tu sdnorx) 
iy — ty o3y 
(xt) (ita) (11a) (tA) (A) (At) | (1) (1) 























‘aungnny uo qyhrayy buayry fo squaroifa0g uorssasbay ysa} OF, “TT ATAVL 











FLORENCE N. Davip 5 


practical standpoint, we may illustrate it on the data for the 28 Age Groups in the 
regression of Sitting Height on Stature. The necessary working is given at length 
in Table II. The numbers are here sufficiently large to justify for present purposes 
the use of a normal curve (vii) for the distribution of Rg,.s;. Column (i) indicates 
the Age Groups, (ii) the number, ;, of cases in the sample, (iii) the observed value 
of Rsz:s:, (iv) the difference of this from the weighted mean value Rgn: se, (v) the 
squares of the entries in (iv), to ascertain the weighted standard deviation of Rg,. sg, 
(vi) gives this weighted standard deviation a divided by Vn;*, (vii) gives the ratio of 
the deviation in (iv) to the standard deviation a/Vn, of Rgx:s¢ in (vi), (viii) gives 
the values of p, and finally (ix) the values of logy) p,. The sum of the values in this 
column (ix) is — 11°5865,5666 and _ since Vn logio e =V28 logig € = 2°298,0705, 
u = 5°041,8630 and J (27, w)=-42451 from the Incomplete ['-Function Table, using 
third differences. Thus the probability on our hypothesis of more divergent sets of 
samples of 28 Age Groups occurring is ‘57549 or some 58 Y of sets would exhibit 
greater divergence. As far as this test is concerned it seems not unreasonable to 
suppose that the regression coefficient of Sitting Height on Stature is constant for 
all ages. 

We may now examine this problem supposing the probability integrals measured 
in the reverse direction. 


The values of 1 — p;, and their logarithms are given in Table III below: 


TABLE IIL. 


Probability Integrals of the Regression Coefficients of Sitting Height on Stature. 




















i i 
| No. | Value of 1-p, | logyo (1 — py) No. Value of 1—p, logy (1 —p) 
| | ] ee, 
P 00466 |  3°668,3859 15 | 41879 | I1-621,9963 | 
2 “67003 1°826,0942 16 | *71904 | 1°856,7531 
Wee 61409 1-788,2320 iz 55764 1°746,3539 
| 4 -30503 1°484,3426 18 | 47210 1°674,0340 
oe “927855 1°967,4801 19 | “56553 1°752,4556 
ae “01463 2°165,2443 20 | 07493 2°874,6557 
Be: *82639 1°917,1851 21 “44038 1°643,8276 
8 *97882 1-990,7028 22 *50000 1-698,9700 
9 *13136 1°118,4631 23 “41294 1°615,8870 
10 “63307 | 1°801,4517 24 *72240 1°858,7777 
11 *10027 1-001,1710 25 “28774 1°459,0002 
12 67724 1°830,7426 26 *71226 1-852,6386 
13 29116 | 1°464,1317 27 -78230 1:893,3733 
14 ‘83147 | 1-919,8466 28 67364 1°828,4279 
Ses (PP ee eer Se eta 
: a ee — logy Ay = 12°679,3754 
| Jn logy) e= 2°298,0705 u=5°57,4005 

















is found at once from the tabled values in Comrie’s edition of Barlow's Tables, 


A 1 
J Ne 








6 P,,, Test for Randomness 


Hence P,,, = I (27, 5°517,4005) 
= ‘6124, 


from the Incomplete T-Function Table. And accordingly we may take Q,, = ‘388, 
which is somewhat more stringent than the Q,, =°575 of the A method as noted in 
Table I, but would not alter our judgment that the hypothesis that all the regres- 
sion coefficients may be taken as due to random sampling from the same normal 
population of regression coefficient Rg, - 5: = 8362 is not unreasonable. Had we asked 
whether Case No. 1 of regression coefficient Aga; 5; = "6261 was a reasonable sample 
from a population of Rg),.s; = "8362 we should have answered in the negative. This 
indicates how different our judgment may be when we are considering one or 
twenty-eight samples. Table IV below gives the value of P,, for given values of 
n, the number of probability integrals, and of —logiX,, i.e. the logarithm of the 
product of the probability integrals to base 10 with the minus sign before it. This 
table saves the trouble of computing u= —108:0%n and then interpolating into the 
Vnlogioe 
Incomplete T-Function Table. The latter must still be used for values n > 30. 


The table will suffice by using &P,, to obtain values of P,, to four figures 
except when » and — logiA, are very small, say n< 6 and — logyA, < 1°125, when 
5* P,, may be found from 8 P,, if we desire four-figure accuracy. 


The present Table IV may be used to find without much extra labour P (x, n’). 
It is true that it will only give the value of P(y?,n’) to four figures, but this is 
usually ample, and it doubles the range of the table in Tables for Statisticians, 
going up to n’=61. All we need to do is to enter the present table with 

n=4(n’—1), and — logirA, = $ x” logwe, 

and then P (8, n’) =l]-— Pigs 

For example: What is the value of P (x*, n’) for x? = 14, n’=17? 

Here n=8, —logiokn=7 x logie=7 x *434,2944 = 3:0406. 

But from our column for x= 8, we have 

—logyoAn, for 3°00, ="3875, 


—logiorn, for 3°25, ="4729, 


whence by linear difference the required P,,, 
= 3875 + 0854 x 1624 = 4014, 
and P(x’, n’) = 5986. 
Palin Elderton's table gives ‘5987, which is the value actually provided by the 
present table, if second differences are used. 


Of course the Tables of the Incomplete '-Function carry the P (x?, n’) Table to 
7 decimal places up to n’= 101, but a table to 4 decimal places up to n’=61 will 
be found not without value. 
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TABLE IV. 
Table of the P),, Function. 


















































| Sf n=2 n=3 n=4 n=5 n=6 -; n=4 (cont.) | n=5 (cont.) | n=6 (cont.) 
—] c 
aa eee aaa (EI SRE 
1 Py, &P,,, Py, SP, Py &P,,, P,,, Py ‘Py Px, &P,,, 1 Py, &P,,, Py &P,, Py, &P,,, 
re { 
| 000 | -0000 +818 | 0000 + 7] 0000 —12/ -0000 — | -0000 —I| 7-500 | -99997 — | -9998°° — | 9994 — 
| -125 | -0343 +454 2 +144] -0002 +24] -0000 + 3] -0000 —| 17-625 | -99997 — | -9999- — | -9995° — 
; -250 | -1139 +206 | 0208 +186 | 0029 +62} 0003 +13 | 0000 + 2] 7-750 | -9999§ — | -99999 — | 9996 — 
375 | -2142 +53 | -0570 +170] -0117 +91 | 0020 +30] -0003 + 7] 7-875 | -9999% — | -99997 — | -9997 — 
5 3127  -—36 | -1101 +128 | -0297 +105 | -0065+ +47 | -0012 +416] 8-000 | -99999 — | -9999 — | 9998 — 
675 | -4216 -—84]| -1760 +80] -0582 +105 | -0158 62 | 0037 +25] 8-125 | -99999 — | -99995° — | -9998 — 
| -750 | -5151 —106 | -2499 +36 | -0973 92 } -0314 +71 | 0086 +36] 8-250 | -99999 — | -9999° — | -9998° — 
| -875 | 5980 —111 | 3273 -—9]| -1456 +73 | -0540 74 | -O171 +45] 8-375 | -99999 — | -99997 — | -9999- — 
| 1-000 | -6697 -107 | -4046 -28 | -2012 +52] 0841 +71] 0301 +52] 8-500 |1-0000- — | -9999§ — | -99999 — 
| 1-125 | -7308 -—99 | -4792 —47 | -2619 28 | -1212 63 | 0483 456] 8-625 9999 — | -9999° — 
| 1-250 | -7819  —87 | -5490 —58 | -3255+ + 8 | -1647 52 | -0721 +56] 8-750 9999 — | -99994 — 
| 1-375 | -8243 -76 | -6130  -—64| -3899 -— 9] -21384 +38] -1016 +53] 8-875 9999 — | -9999° — 
1-500 | -8592 —64 | -6705+ -—66 | -4534 -—23 | -2659 +24] -1364 +48] 9-000 99999 — | -9999° — 
1-625 | -8876 -—54]|-7216 -—64]| -5145+ -—34 | -3208 +11] -1759 +40] 9-125 -9999° — | -99997 — 
1-750 | -9106 —45 | -7662 —60 | -572% —41 | -3769 — 1] -2195+ +31] 9-250 -9999° — | -99997 — 
1-875 | -9291 -—37] -8048 -—55 | -6260 -—45 | -4329 -—12]| -2662 +422] 9-375 1-0000- — | 99995 — 
2-000 | -9439  -—30]| -83879 —50 —47 | 4877. --20] -3151 +12] 9-500 9999 — 
2-125 | -9558  -—24]| -8660 -—44 —47 | -5405+ --27 | -3653 + 3] 9-625 -9999  — 
2-250 | -9652 -—19]| -8898 -—38 —45 | 5906 -32] -4157 -— 5] 9-750 9999 — 
2-375 | -9727 —16 | -9097 —33 —43 | -6376 —36 | -4657 —12 9-875 ‘9999 — 
2-500 | -9786 -—12 | -9262 -28 —40 | -6810 -36 | -5145- -—18 | 10-000 9999 — 
2-625 | -9833 -10]| -9400 -24 —36 | -7208 -37| -5614 -—23] 10-125 9999 — 
2-750 | -9870 — 8| -9513 -—20 —33 | -7569 --36 | -G061  -—26 | 10-250 1-0000-  — 
2-875 | -9898 -— 6] -9606 —-17 —29 | -7894 -35 | -6481 -—29 | 10-375 
3-000 | -9921 -—5]| -9682 -14 —26 | -8184 -—33 | -6874 —-30 
3-125 | 9989 -— 4] 9744 -12 —23 | -8441 31 | -7236 -30 
3-250 | -9952 -— 3] -9795- -10 —20 | -8667 -—28] -7567  -—30 
3-375 | 9963 — 2]| 9836 -8 —17 | -8865+ -—26 | -7869 -—29 
3-500 | -9971 -— 2] 9869 -— 6 —14 | -9087 -23] -8141 -28 
3-625 | -9978 — 1] -9895+ -— 5 | -9665+ -—12] -9186 -21 | -8385+ -27 
3-750 | -9988 — 1] -9917 -— 4] -9726 -— 9] -9314 —-18] 8603 -—25 
| 3-875 | -9987 — 1] 9934 -— 3] -9776 -—9]| 9424 -16] -8795+ —2% 
| 
| 4-000 | 9999 — 1) 9947 -—3]| 9817 -—7]| 9517 —-14] -8965- -21 
4-125 | 9992 -—1]| 9958 -— 2] 9851 -—6]| -9597 -12]|-9114 —-19 
| 4-250 | -9994 — | 9967 — 2] -9879 -— 5] 9664 -10] 9244 —-17 
| 4-375 | -g995+ — | 9974 -—1/ 9902 -— 4] 9721 - 9] 9357 -15 
| 4-500 | -9996 —/|-9979 -—1]-9921 -—3] 9769 -—8| 9454 -14 
| 4-625 | -9997 — | 9984 -—1] 9936 -— 3] 9809 —7]| 9538 —-12 
| 4-750 | -9998 — |} 9987 -1 | 99485 — 2] 9842 — 6] -9605+ —-10 
| 4-875 | -9998 —j}-9999 -— 1] -9958° -— 2] -9870 -— 4] 9672 -—9 | 
, 5-000 | -9999 — | -9992 — | 9967 —-—2 9894 -— 4] 9725- -— 8 
5-125 | -9999° — | -9994 —| 9973 -— 1] 99138 -—3]|-9770 -7 
5-250 | -99998 —j| 9995+ — | 9979 — 1] -9929 -—3]-9808 —-6 
75 | -9999% — | -9996 — | 9983 —1] 9942 -—2/ 9840 -5 | 
-9999% — | -9997 — | 9966 — 1/9052 —2/] 0867 — <A 
325 | -99997 — | -9998 —|-9989 -— 1] -9961 -— 2] -9889 -—4 
750 | -99997 — | -9998 — | -9991 —| 9968 —1]-9908 -3 
| 5-875 | -99998 — | -9999 — | -9993 —| om -1 | 9924 — 3 
| 
| 6-000 | -9999° — | -9999° — | -99945 — | 9079 - = 1) 9937  -— 2 
| 6-125 |1-0000-  — | -9999! — | -9996 — | -9983 -—.1] 9948 —-— 2 
| 6-250 99992 —|-9996° —J|-9986 -—1]|-9958 — 2 
| 6-375 -9999° — | -9997 — | -9989 —| 9965+ - 1 | 
| | 6-500 9999  — | -9998 — | -9991 —|-9971 -1 
| 6-625 -99997 — | -9998 — | 9993 — | 9977 —-1 
| 6-750 99998 — | -9999 — | 9994 —- | wet >. 9 
| 6-875 “99998 — | -9999 —j| 9995+ —/ 9984 -1 
j i 
| 7-000 -9999° — | -99992 _ | 9996 — | 9987 -1 
| 7-125 -9999° — | -99998 — | -9997 — | -9990 = 
| 7-250 1:0000- — | -9999% — | -9998 — | -9992 — 
| 7-375 ‘9999 — | -9998 — | -9993 — 
| | care 















































8 P,, Test for Randomness 
Table of the P),, Function (cont.) 

- n=7 n=8 n=9 n=10 n=11 n=12 n=13 n=14 = 

so cs 
# 2 
oe Bae hg Mike, eae eles hee | ea he Ohl Rig eg ibe Oa Pye: BP aT 
0-00 | -0000 — } 0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | 0-00 
0-25 | 0000 + 2 | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 —| 0-25 
0-50 | 0002 +17) 0000 + 4/ 0000 + 11] -0000 — | -0000 — | -0000 — | -0000 — | -0000 —j| 050 
0-75 | -0020 +55} 0004 +18] -000i1 + 4/ -0000 + 1 | -0000 — | -0000 — | -0000 — | -0000 —| 0-75 
1-00 | -0094 +109 | -0026 +47 /| -0006 +18! -0001 + 6 | -0000 2! -0000 + 1 | -0000 — | -0000 —| 100 
1-25 | -0277 +157 | -0095~ +86 | -0029 +39] -0008 +15 | 0002 + 5] -0001 + 1] -0000 + 1) -0000 —|} 1:25 
1-50 } -0618 +179 | -0250+ +124 | -0091 +69 | -00830 +33] -0009 +14/ -0003 + 5] -0001 + 1) -0000 —| 1-50 
1-75 | -1138 +169 | -0529 +147 | -0223 +99 | -0085+ +56 | -0030 +27] -0010 +12] -0003 +5] -0000 +3] 1-75 
2-00 | -1827 +132 | -0955+ +148 | -0453 +121 ; -0196 +82] -0078 +47 | -0029 +23] -0010 +10! -00083 + 4] 2-00 
2-25 | -2647 +79 | -1529 +129 | -0805+ +129 | -0389 +100 | -0173 +66] -0071 +38 | -0027 +20] -0010 +9] 2-25 
2-50 | -3546 +23 | -2232 +93 | -1287 +121 | -0682 +112 | -0334 +84] -0151 +55 | -0064 +31.) 0025+ +16] 2-50 
2-75 | -4469 -—27 | -3029 +50] -1889 +99 | -1087 +111 | -0579 +97 | -0287 +71] -0132 +46] -0057 +26] 2-75 
3-00 | -5365- —65 | -3875+ + 7/ -2590 +67 | -1603 +98 | -0921 +100 | -0493 +84] -0246 +60] -0116 +38] 3-00 
3-25 | 6196 -—89 | -4729 -31 | -3358 +32 | -2217 +76 | -1363 +95 | -0783 +89 | -0421 +72] -0212 +51] 3-25 
3-50 | -6988 -—101 | 5552 -—59 | -4157 — 3 -2907 +49] -1900 +80] -1162 +90] -0667 +80] -0360 +62] 3-50 
3-75 | -7579 -—103 | -6316 -—78 | -4953 -—33 | -3646 +18! -2517 +58 | -1631 +80] -0993 +82] -0569 +71) 3-75 
4-00 | 8117 -—98 | -7002 -—88| -5717 —-55 | -4403 -1 3192 +35 | -2179 +65] -1401 +78 | -0849 +75 | 4-00 
4-25 | -8558 -—88 | -7599 -90j; -6426 -—70| 5150+ -—33 | 3902 + 9] -2792 +45] -1886 +68 | -1204 474] 4-25 
4-50 | -8911 -—76| -8106 -—86| -7064 -—78 | -5864 -—52| -4621 -—14 | -3450+ +24 | -2489 +53] -1633 +68] 4-50 
4-75 | -9188 -—63 | -8527 -—79 | -7624 -80| -6526 -—63 | -5326 -—34 | -4132 2 | -83045- +35 | -2129 +57] 4-75 
5-00 | -9401 —51| -8870 —-—69 | -8104 -—T77 | -7125- -—71]| -5997 -—48| -4817 -—17 | -3685+ +16 | -2683 +43] 5-00 
5-25 | 9564 -—41]| -9143 -—59| -8507 -—71| -7653 -—73 | -6620 -—59 | -5485- -34| -4342 -— 2] -8278 +27] 5-25 
5-50 | -9685+ -—32 | -9357 -49 | -8839 -63 | -8108 -—68| -7184 -—65 | -6119 -—46 | -4995+ -19 | -3901 +10] 5-50 
5-75 | 9775+ -—24 | 9524 -39 | -9107 -—55 | -8495- —66 | -7683 -65 | -6707 -—54 | 5680 -—33 | -45838 — 6] 5-75 
6-00 | -9841 -18 | -9650+ -—31 | -9321 -—46| -8816 -—58| -8117  -—63 | -7241 -59| -6231 -44]| 5159 -21]| 6-00 
6-25 | 9888 -—13| 9746 -24] -9489 -—38 | -9079  -—51]| -8488 -—60| -7716 -60]_-6789 -—51| -5764 -—33)| 6-25 
6-50 | -9922 -10| -9817 -—19]| -9619 -31 | -9291 -44]| -8799  -54]| -81830 -—59 |‘7296 -—55 | -6336 -—42] 65 
6-75 | 9946 -— 8| -9869 -—14] -9719 -—24]| -9459 -35] -9056 -—48 | -8486 -—55 | -7749 -—56 | -6867 —48| 6-75 
7-00 | 9962 -— 5] 9907 -—1li| -9794 -—19] -9592 -—31] -9265+ —41 | -8788 -—51 | -8146 —54 | -7350- -—51]| 7-00 
7-25 | -9975- — 3) -9934 — 8] -9850+ -—15]| 9694 -23 | -9434 -35 | -90388 -—43 | -8488 -—51 | -7782 -—52] 7-25 
7-50 | -9983 — 2] -9954 -— 6] -9892 -—11] -9773 -19]| -9567 -—29 | -9245- -—40] -8780 -—47]| -8163 -—50] 7-50 
7-75 | 9988 - 1] -9968 —.4] -9923 — 8] -9833 -—15] -9672 -—23] -9412 -—33 | -9025- -—42] -8498 -—48]| 7-75 
8-00 | 9992. — 1] -9978 -— 3] -9945+ — 6 | -9878 -—12] -9754 -19]| -9546 -28] -9228 -37] -8776 -44]| 8-00 
8-25 | -9995- — | -9985- — 2] 9961 -— 5| -9911 -— 8] -9816 —15 | -9653 -23] -9393 -—32]| -9015- -—40| 8-25 
8:50 | -9997 —|-99909 -—2]| 9973 -— 4] -9936 -— 7] 9864 -—12]| -9736 -—19]| -9598 -27! -9214 -35| 8-50 
8-75 | -9998 — | 9993, — 1] 9981 -— 3| 9954 -—5| 9900 -— 9 | -9801 —15 | -9635+ —22 | -9378 -30| 8-75 
9-00 | -9998 — | 9995+ —)| -9987 — 2] 9967 -— 4] 9927 -— 7] 9851 -—12]| -9720 -18]| -9512 -26] 9-00 
9-25 | -9999-  — | -9997 —/}-9991 -—1] 9977 -—3]| 9947 -— 5] -9889 -— 9] -9787 -15] 9620 -22] 9-25 
9-50 | -99999  — | -9998 —| 9994 -— 1] -9984 - 2| -9962 -— 4] -9918 -— 7] -9839 -12] -9706 -18]| 9-50 
9-75 | -9999° — | -9999- at ens — 1] 9989. — 2] -9973 -—3] 9940 -— 5] -9879 -—9] -9774 —-14] 975 
10-00 | -99997  —J| -99999 — | -9997 —j|-9992 -—1] -9981 -— 2] 9956 -—3]|-9910 -— 7] -9828 -—12] 10-00 
10-25 | -9999° — | 99991 — | -9998 —|-9994 -— 1] 9986 -— 1] -9968 -— 2] -99883 -— 5] -9870 -—10]| 10-25 
10-50 | -99999 — | -9999° —| -9999- — | -9996 —j}-99909 -—1] 9977 -— 2] 9951 -— 4] -9902 — 8 {| 10-50 
10-75 | -99999  —| -99997 — | -99991 9997 —|-9993 — 1] -99838 -— 1] -9964 -— 3] 9927 -— 7] 10-75 
11-00 |1-0000- —/ 99998 — | .99994 — | -9998 — | -9995+ —]| -9988 -— 1] -9973 — 2] -9945+ -— 5] 11-00 
11-25 9999  —| -9999° —|-9999- — | -9997 —j|-9992 — 1] -9981 -—1] -9960 — 4] 11-25 
11-50 99999 a -99997 — | -99997 — | -9998 — | -9994 — | -9985+ — 1] -9970 — 3] 11-50 
11-75 1-0000- - | 9999 —J| -9999' — | -9998 — | -9996 —|-99909 -—1] -9978 — 2] 11-75 

| 

12-00 | 99999  —/| -9999° —]| -9999- — | -9997 — | -9993 — | -9984 — 1] 12-00 
12-25 | -99999  — | -99997 — | -99999  — | -9998 —|-9995- —]| -9989 — 1] 12-25 
12-50 | 9999  — | -99998 —| -99995 — | -9999- — | -9996 —j| 9992 -— 1] 1250 
12-75 11-0000- — | -99999 —]| -99997 — =] -99999 — | -9997 — | 9994 -— 1] 12-75 
13-00 99999 99998 —| -9999° — | -9998 — | -9996 — | 13-00 
13-25 1-0000- — | -9999° —]| -99995  —j| -9999- — | -9997 — | 13-25 
13-50 9999 —/ -99997 — | -99991 — | -9998 — | 13-50 
13-75 99999  — | -99995 —/| -99994 — | -9998 — | 13-75 
14-00 1-0000- — | -99999 —j| -9999° — | -9999- — | 14-00 
14-25 9999 —| -99997  — | -99992 — | 14-25 
14-50 99999 —/| 99995 — | -99995 — | 14-50 
14-75 i-0000-  —/| -99999 — | -g999¢ — | 14-75 
15-00 9999 — | -99997 — | 15-00 
15-25 9999 — | -99998 — | 15:25 
15-50 100007 —/} -99999  — | 15-50 
15-75 ‘9999 = — | 15:75 
16-00 99999 —| 16-00 
16-25 1.0000" — | 16:25 
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FLORENCE N. Davip 9 
Table of the P),, Function (cont.) 

n=15 n=16 n=17 n=18 n=19 n=20 n=21 n=22 <= 

—J 

= 

Py OP Py OPS ig: OR igh She tg) tag ea Py, CF,| 2 CR ey Sei a 
-0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 _ 1-50 
‘0000 =6+ 1) -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | 0000 — | -0000 _— 1-75 
0000 + 1) 0000 + 1! -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 _ 2-00 
0003 «+ 4 -0001 + 1] 0000 + 1) -0000 — | -0000 — | -0000 — | -0000 — | -0000 _ 2-25 
0009 + 8} -0003 + 4! -0001 + 1) 0000 + 1 | -0000 — | -0000 — | -0000 — | -0000 _ 2-50 
“0022 +14} 0009 + 7] 0003 + 3] -0001 + 11] -0000 + 1] -0000 — | -0000 — | -0000 _ 2-75 
0051 +22 | -0021 +12] -0008 + 6] 0003 + 3/ 0001 + 1] -0000 + 1! -0000 — | -0000 —_— 3-00 
0101 =+32 | -0045+ +19] -0019 +10) -0008 +4 | 0003 + 2) -0001 + 1) -0000 +1 | -0000 — 3-25 
0183. +43 | -0088 +27 | 0040 416)/ 0017 + 8} OT + 4/ 0003 + 2/ 0001 +1) 0000 +1 3-50 
0308 §=+54 | -0158 +37 | -0076 +23 | -0035+ +13 0015+ + 8 | 0007 + 3) 0003 + 1) -0001 os 3-75 
0487 +62 | -0264 +446 | -0136 +31 0067 +20 | -0031 +11 | 0014 + 6) 0006 + 3) 0002 +2 4-00 
0728 +68 | -0417 +55 | -0227 +40 | -O117) +27) -0058 +16 | -0027 +10] 0012 + 6/ -0005+ + 3 4-25 
1037 =9+71 | -0625- +61 | -0358 +48 | -0195+ +35 | -0101 +24 | -0050+ +14 | -0024 4 8/0011 4 4 4-50 
71414 +69 | -0893 +64] -0537 +54] -0307 +42] -0168 +29] -0088 +2 0044 +12) 0021 +7) 475 
-1858 +66 1225+ +631] -0770 +59 | 0461 +49 | -0264 +37 | -0145- +26 | -0076 +417 | 00388 +10 5-00 
-2360 +59 1621 +59 1062 +60 | -0664 +853 “0397 +44 | -0227 +32) -0125- +22 | 0066 +14 5-25 
-2911 +48 | -2075- +51] -1413 +57 | -0920 +56 ! 0574 +47 | -0342 +38] -0196 +428] -0108 +19 5-50 
| 3496 +34 | -2580 -40 | -1822 +52 | -1232 +55 | -O798 +52 | -0496 +44 | -0295+ +34/ 0169 4425 5-75 
4100 +20) -3125+ +28 2283 +44] -1599 +52) -1074 +52 | -0692 +47] -04298 +39 / -0255- +30 6-00 
4709 «+ 4/ 3698 +14/ -2788 +34] -2018 +46 | -1402 +52 | -0936 +49] -0601 443] 0871 +35] 6-25 
5309 —21 | -4285 - | 3327 + 22 | -2483 +38 | -1782 +46 | -1229 +49 | 0816 +46 | -0521 +439 6-50 
| 5888 —32 | -4873 -—12]| -3887 +10 | -2986 28 | -2208 +41 | -1572 +47] -1077 +47 7ll +43) 6-75 
6434 -40/| 5449 -23 | -4458 —-— 2 | -3517 +20) 2675+ +33 | -1961 +42 | -1385+ +46 | -0944 +44 7-06 
| ‘6940 -—45/ -6003 —-31 5025+ -—13 | -4065+ + 6 | *3175- +22 | -2392 +36) -1739 +43) -1220 +44 7-25 
‘7402 —48 | -6526 -—38 | -5580 -—23 1619 - 5 | -3697 +14] -2859 +28 | -2136 +38] -1541 +438 7-50 
‘7816 -—48 | -7010 —42] -6111 —31 5168 —-—15 | 4233 + 2 | 3354 +19] -2570 +31] -1904 +39 7-75 
8182 -—47 | -7452 -—45 | -6612 -—36 9702 —- 2% A771 — 7/ 3868 + 9] 30386 +23] 2306 +34 8-00 
-8501 —45 | -7850- —45 JOTT —40 | -6213  -—30 5302 —16 | -4391 — | 3525+ +15 | -2742 +27 8-25 
‘8775- —41 | -8202 -—44 7501 —42 | -6694 -—35 | -5817 +23 | 4914 — 9/| -4029 + 6 | -3205+ +20 8-50 
9007 -—37 | -8510 -—42/] -7883 -—43]| -7141 -—38 | -6309 -29]| -5428 -17] -4540 — 2] -3688 +12] 8-75 
‘9203 -—33 | -8776 -—39 8223 —42 7549 —40 | -6772 -—33 | 5926 -24/] 5049 -10/ -41838 4 4 9-00 
‘9364 -—29 | -9002 -—35] -8520 -40/] -7916 -40 | 7202 -—37 | -6400 -27]| 5547 -17)| 4682 -4 9-25 
9498 -—25 | -9193 -—31]| -8778 -—37 | -8244 -39]| -7594 -38]| -6847 -—33] -6029 -23] 5177 -11] 9:50 
9606  -—21 | -9353 -—28 | -9000 -—34 | -8532 -—38/| -7949 -—38] -7260 -35 | -6487 -28] 5661 —-—17] 9-75 
9693 —18 | -9485+ -—24]| -9187 -30]! -8783 -—35 | -8266 -—37 | -76389 -—36] -6918 -—31 | -6127  -23 | 10-00 
9762 -15 | -9593 -—20] -9344 -27]| -8998 -—32/ -8545- -—35] -7982 -36 1 -7317 -—33 | -6570 —27 | 10-25 
‘9817 —12 | -9681 —17 | -9475- -—23 | -9182 -—29) -8789 -34/]| -8288 -36/! -7682 -—-: “6986 -—30 | 10-50 
‘9860 —10)| -9751 —14 | -9583 -—20/| -9337 -—26 | -8999 -—30/ -8558 -—34/ -8014  -—35 | -7871 —32 | 10-75 
‘9894 — 8 | -9808 —12 | 9670 -—17 | 9466 -—22/ -9179 -—28 | -8795+ -—32) -8310 -—34 | -7725- -—33 | 11-00 
‘9920 — 6] -9852 -—10)] -9741 —14 | 9573 -—19 | -9331 —24 | -9001 -—29 | -8573 -—32 | -8045+ -—33 | 11-25 
‘9940 — 5] 9887 -—8]|-9798 —12] -9661 7 | 9459 -22)/ -9177 -—26 | -8803 -—30)] -8332  -—32) 11:50 
9955+ — 4] -9914 -— 6] 9844  -10]| -9732 1 -9565- —19 | -9326  -—23 | -90038 -—28 | -8587 —-—31 | 11-75 
‘9967 — 3] -9935- -— 5] -9880 -— 8] 9790 — 9652 -15 | -9452 -21] 9176 -—25/ 8812 —-2 12-00 
‘9976 — 2] 9951 -— 4] -9908 — 6 | -9836 9 | 9724 -—14] 9558 -—18 | -9323 -23 | -9007 -—27 | 12-25 
‘9982 — 2] 9963 -— 3] 9930 -— 5] -9873 —- ‘9782 —-11 9645- —15 | 9447 -—20/ 9176 -—24]/ 12-50 
9987 — 1] -9973 — 2] 9947 -— 4] 9902 -— 6] -9829 -10/ 9717 -—13 | -9551 -—17 | -9320 -—322 | 12-7 
9990 — 1] -9980 — 2] -9960 — 3] -9925+ — 5/| -9867 -— 9] 9775+ -—11 | -9638 -—15 | -9443 —-19 | 13-00 
9993 — 1] -9985+ — 1] -9970 —2]|-9943 -— 2; 9897 -— 7] 9823 -—9]| 9710 —-—13] 9546 -—17 | 13-25 
9995+ —/ -9989 -— 1] 9978 -— 2] -9957 -— 3] -9920 -— 5] 9861 -— 8] 9769 —11 | -9682 —-—15 | 13-50 
-9996 — | -9992 -— 1] 9983 -— 1] 9967 — 3] 9988 -— 4] 9892 -— 6] 9817 -—9]| 9704 —-13] 13-75 
-9997 — | -9994 — | -9988 -— 1] -9975+ — 2] 9953 -— 3] 9916 -—5]| -9856 -— 8] 9763 —-11 | 14-00 
9998 — | -9996 — |} -9991 -— 1] 9982 -— 2] 9964 — 2] -9935+ — 4] -9887 -— 6] -9811 9 | 14-25 
9999- — | -9997 — | -9993 — | -9986 — 1] 9973 — 2] -9950+ 3 | 9912 -— 5] 9850+ -— 8 | 14:50 
99991 — | -9998 —/|-9995- —]-99909 -— 1] -9980 — 2] -9961 3] -99381 — 4/] 9882 — 6] 14-75 
99998 — | -9998 — | -9996 — |} -9992 — 1] -9985- — 1] 9971 -— 3] 9947 -—3] 9908 —-— 5] 15-00 
“99995 — | -9999— — | -9997 — | -9994 — | -9988 -—1]-9978 -— 2] - — 3) 9928 -— 4] 15-25 
-99997 — | -9999° — | -9998 — | -9996 —/|-9991 -—1)/ 9984 -—2/] 9968 -— 2] 9944 —-— 3] 15:50 
“99998 — | -9999¢ — | -9999- — | 9987 — |} -9994 —1/ 9988 -—1]-9976 -—2/| 9957 -— 3] 15-75 
“99998 — | -9999° — | -9999° — | -9998 — | -9995- —/}-9991 -—1|-9982 — 1) 9967 — 2] 1600 
-9999° — | -99997 — | -99998 — | -9998 — | -9997 — |} -9993 -— 1] -9985+ — 1] 9974 —-— 2) 16-25 
*9999° — | -9999° — | -99995 — | -9999- — | -9998 — | -9995- — |} -9999 -—1)/ -9981 — 1] 16-50 
“9999° — | -9999° — | -99997 — | -9999" — | -9998 — } -9996 — | -9992 — | 9985+ -— 1 | 16-75 
10000" — | -9999° — | -99998 — | -99994 — | -9999- — | -9997 — | -9994 — | -9989 -— 1) 17-00 
“99999 — | -99998 — | -9999° — | -9999° — | 9998 — | -9996 — | -9991 — 1] 17-25 





































































































10 Py, Test for Randomness 
Table of the P),, Function (cont.) 

Sf | 2=15 (cont.) | n=16 (cont.) | n=17 (cont.) | n=18 (cont.) | n=19 (cont.) | n=20 (cont.) | n=21 (cont.) | n=22 (cont.) P 
: = : 
| Pr, PPay Prn PP), Prn FPA, P An &P. An Py, PP a, Pr, Pr, Prr FPP), Prn ePPa,, + 

17-25 -9999° — | -99998 — | -9999° — | -9999° — | -9998 — | 9996 —/} 9991 — 1] 17-25 
17-50 1-0000- — | -9999° — | -99997 — | -9999 — | -9998 — | -9997 — | -9994 — | 17-50 
17-75 -9999° — | -99998 — | -99995 — | -9999- — | -9998 — | 9995+ —} 17-75 
18-00 “99999 — | -99998 — | -9999° — | -9999? — | -9998 — | -9996 — | 18-00 
18 25 1-0000- — | -9999° — | -99997 — | -99994 — | -9999- — | -9997 — | 18-25 
18-50 99999 — | -99998 — | -9999° — | -99991 — | -9998 18-50 
18-75 “99999 — | -9999° — | -99997 — | -99998 — | 9999" —/| 18-75 
19-00 1-0000- — | -9999° — | -9999° — | -9999° — | -9999- — | 19-00 
19-25 -9999° — | -99998 — | -9999° — | -9999? — | 19-25 
19-50 1-0000- — | -9999° — | -99997 — | -99994 — | 19-50 
19-75 -9999° — | -99998 — | -9999° — | 19-75 
20-00 -9999° — | -9999° — | -99997 — | 20-00 
20-25 1-0000— — | -9999° -9999* — | 20-25 
20-50 -y999" — | -99998 — | 20-50 
20-75 1-0000— — | -9999° — | 20-75 
21-00 -9999° — | 21-00 
21-25 -9999° — | 21-25 
21-50 /1-0000-  — | 21-50 
PSS os SE x dep Dees ae 
T ae ain ] 

S n =23 n=24 n=25 n=26 n=27 n=28 n=29 n =30 < 
— = 
ag < Sea cy ak a Loe eek ets ee ae ee 
| Pr, Py, Prp BP an Pan FP, EX FP), Pry dP, Pr» SP, Pry Py, Prn ePr, | 

ae 

3-50 | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — 3-50 

3-75 | 0000 + 1 | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | -0000 — | 0000 —| 3-75 

4-00 | -0001 — | 0000 + 1 | -0000 — | ‘0000 — | -0000 — | -0000 —- | -0000 — | -0000 — 4-00 

4:25 | 0002 + 2) -0001 — | 0000 + 11 -0000 — | -0000 — | -0000 — | -0000 — | -0000 — 4-25 

4-50 | 0005+ + 2] 0002 + 1] -0001 — | 0000 + 1 | -0000 — | -0000 — | 0000 — | -0000 _ 4-50 

4-75 | 0010 + 3) 0004 + 2/ 0002 + 11] -0001 — , 0000 +1 0000 — | -0000 — | -0000 — 4-75 

5-00 | 0018 + 7/{| 0009 + 3] 0004 + 2/ 0002 + 1 0001 —/} 0000 +1 0000 — | -0000 ~- 5-00 

5-25 | 00833 + 9 | 0016 + 6] 0008 + 2] -0004 — | -0002 — | -0001 ‘0000 + 1 | -0000 _ 5-25 

5-50 | 0057 +12) 0029 +9] 0014 + 5} -0006 + 4] 00038 + 2] -0001 + 11] -0001 — | -0000 + I 5-50 

5-75 | 0093 +17 | -0049 tll 0025+ + 7 0012 + 4| 0006 + 2] 00038 + 2 0001 + 1 | -0001 —_— 5-75 

6-00 | 0146 +21 | 0080 +15 | -0043 +10 | 0022 + 5} -OOl1 + 4] 0005+ + 2] 0002 + 2/ -0001 + 1 6-00 

6-25 | -0220 +26 | -0126 +18 0070 +13 0037 + 8 | 0019 + 5 | -0010 +3 0005- + 2 | “0002 +1 6-25 

6-50 | -0321 +31 | -0190 +24 0109 —- ++-:16 ‘0060 +11 0032 + 7 | 0017 + 4 0008 + 3 | -0004 + 2 6-50 

6-75 | 0453 +36 | -0278 +27 0165- +20 | -0094 +14 0052 + 9 | -0028 + 6] 0014 + 4/ 0008 + 2 6-75 

7-00 | -0620 +39 | -0393 32 | 0241 +25) 0142 +18] 0082 +12] -0045+ + 8] 0024 +5] 00183 +3 7°00 

7-25 | 0826 +41 | 0540 +36 | 0341 +29 | 0209 +22] 0123 +16] 0071 +11] 008389 + 7] 0021 + 4] 7:25 

750 | -1074 +42 | -0723 +38 | -0471 +33 | -0297 +26 | -0181 +19 | -0107 +14 | 0061 + 9 | -0084 + 6 7-50 

775 | -1364 +41 | -0945- +40] -0633 +36) -0411 +29 | -0258 +23 | -0157 +17 | -0093 +12 | -0053 + 8 7-75 

8-00 | -1695+ +39 | -1206 +40 | -0831 +38 | -0554 +33 | -0358 +27 | 0224 +21 0136 +15 | 0080 +11 8-00 

8-25 | -2066 +35 | -1508 +39 | -1066 +38 | 0730 +35 | -0485- +30 | -0312 +24] -0195- +18) -0118 +13 8-25 

8-50 | -2472 +30] -1848 +36] -1340 +38) 0941 +37 | 0641 +33 | -0424 +27] 0272 +22) -0170 +16] 8-50 

8-75 | -29008 +24 | -2225- +32 | -1651 +36 | -1189 +437] 0831 +34] -0563 +30) 0371 +25 | 0237 +19] 8-75 

9-00 | -3368 +17 | -2633 +27] -1999 +33) -1473 +36] -1054 +35 | -0733 +32] -0495- +28 | 0324 +422] 9-00 

9-25 | -3844 + 9] 3069 +21] -2380 +29] -1/94 +34] -1313 +35 | -0934 -34 | 0646 +30 | 0434 425] 9-25 

9-50 | -4340 + 1] 3524 +14] -2790 +24] -2148 +30] -1607 +34] -1169 +34] -0827 +32) -0569 +28 9-50 

9-75 | -4817 — 5 | -3994 + 7 | 3224 +18 | -2532 +26 | -1935- +31 | -14388 +33] -1040 +33 | -0732 +30 9-75 

10-00 | -5299 -12 | -4470 — | -3675- +11] -2942 +21 | -2294 +28) -1741 +32] -1286 +32 | -0925- +31 | 10-00 
10-25 | 5768 -18 | 4946 — 7] -4137 + 5 | 3373 +15 | -2680 +23 | -2074 +29] -1564 +31) -1148 +831 | 10-25 
10-50 | -6220 -—23 | -5415- -13 | -4604 -— 2| 3819 + 9] 3090 +18 | 24837 +25] -1873 +29] -1403 +31 | 10-50 
10-75 | -6648 —27 | -5871 -—18/ 5069 — 8| 4274 + 3] 3518 +12 | -2825- +21] -2212 427 /| -1689 +30] 10-75 
11-00 | -7051  -—29 | -6309 -—22 | 5525+ -14 | -4732 -— 3] 3959 + 7] 8234 +16 | -2578 +423 | -2005- +27] 11-00 
11-25 | -7424 -—31 | 6724 -—26/| 5969 -18! 5186 -— 9) -4406 + 1] 3658 +10| 2966 +19 | -2348 +24 | 11-25 
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Table of the P),, Function (cont.) 
a = n =23 (cont.) | n=24 (cont.) | n=25 (cont.) | n=26 (cont.) n=27 (cont.) | n=28 (cont.) | n=29 (cont.) n=30 (cont.) <= 
cs 2 . —7 
s # ie. s 
| + Sa &P,,, Pr, &P,,, op te SP), Be wee | FP, bo &P),, Ae &P,,, Py, &P,,, i 
| my 
17-25 11-25 | -7424 -31 | -6724 -26| -5969 -18] -5186 — 9 | -4406 + 1) 3658 +10 | -2966 +19 | -2348 +24 | 11-25 
17-50 11-50 | -7766 -—32] -7113 —28 | -6393 -22] -5631 —-—14] -4g55- — 5 4093. + 5 | -3373 +14 | -2715+ +21 | 11-50 
17-75 11-75 | -8076 -32 | -7474  -—30] -6796 -26 | -6062 —18 | -5298 —10 | -4533 — 1/| -3793 + 9) -3103 +16 | 11-75 
18-00 12-00 | -8355- -—31 | -7806  -30] -7173 —-—27 | -6475- —22 d732,  —14 | -4972  — 6 | -4223 + °3 | -3508 +12] 12-00 
18-25 12-25 | -8603 -3 8106 = —30 | -7523 —29 | -6865+ —25 | -6152 -18 | 5406 —11 | -4655- — 2 3924. + 7] 12:25 
18-50 12-50 | -8821 -—28 | -8377 -—29| -7844 -29 | -7281 -—27 | -6553 —22 5829. —15 | -5086 — 7] -4348 + 2] 12-50 
18-75 12-75 | 9012-26 | -8618 —28 | -8136 -29 | -7571 —28 | -6932 -24 | 6238 -18 | 5510 —11| -4773  — 3 | 12-75 
19-00 13-00 | -9177 -—23 | -8831 -—27 | -8899 -—28 | -7882 —28 7287 —26 | -6628 -—21/ -5922  -15]| -5195- — 8 | 13-00 
| 19-25 13-25 | -9319 -21] -9017 -—25 | -8634 - —27 | -8166 —28 “7617 —27 | -6996 -—24 | -6320 -18 | 5609 —12 | 13-25 
19-50 13:50 | -9439 -—19] 9179 —225| -8841 —25 | -8421 -27] -7919 —27 | -7341 —-25 | -6700 —21 | -6012 —15 | 13-50 
19-75 13-75 | 9541 —16 | 9318 —20 | -9023 -24 | -8649 -26 | -8195- —27 | -7661 —26 | -7058 —23 | -6400 —18 | 13-75 
20-00 14-00 | 9627 —15 | 9437 -—18 | -9181 —22 | -8852 -24 | -8443 —26 | -7955+ —926 | -7393 —94 | -6769 —21 | 14-00 
20-25 14-25 | -9698 —12 | -9538  -86 | -9318 —20| -9029 -23 | -8665+ —25 | -8223 -26 | -7704 —25|-71I8 —323 | 14-25 
20-50 14-50 | 9757 —10 | -9623 —14 | -9435- -17 | -9184 —21 | -8863 -24 | -8465- —25 | -7990 —25 | -7 —24 | 14-50 
| 20-75 14-75 | 9806 — 9] 9694 —I12 | -9535- —15 | -9319 -19 | -9036 -—22 | -8681 —24 | -8251 -—25 | -7746 —24 | 14-75 
21-00 15-00 | 9846 — 8} -9753 -11 | -9619 —14 | -9434 -17]| -9188 -20] -8874 —23| -s486 —24| -8024 —24 | 15-00 
| 21-25 15-25 | -9878 -— 7] -9801 — 8] -9689 -—12]| -9582 -—15 | -9320 —18 ‘9044 -—21 | 8697 -—23] -8278 -—24] 15-25 
21-50 15-50 | -9 — 6] 9841 -— 7] 9748 -10| -9615+ -—13 | -9433 -16]| -9192 —19 | -88g5+ —22 “8507 —23 | 15-50 
15-75 | 9924 -— 5] -9874 -—6]| -9797 -— 9] -9686 -—11]| -9530 —15 9322 -—18 | -9051 -—20] -8713 -22 | 15-75 
16-00 | -9941 — 4] -9900 - 5] -98387 -— 7] 9744  -10] -9613 -—13| -9433 —16] -9197 —19 | -8897  —21 | 16-00 
16-25 | 9954 -— 3] 9921 -— 4] -9870 -— 6| 9793 - 8] -9682 -11 | -95299 -14| -9324 —17]| -9059 —320| 16-25 
16-50 | 9964 — 2] 9938 -— 3] -9897 —-— 5] -9833 -— 7] 9741 -10] -9610 —12] 9434 —15 | 9202 —18 | 1650 
16-75 | 9972 — 1] 9952 - 3] 9918 — 4] -9866 - 6| 9789 —- 8| -9679 -11| -9598 —14| 9326 —17 | 16-75 
17-00 | 9979 — 1) -9963 — 3] 9936 - 3] -9893 -— 5 | -9830 - 7] -9737 —9]| -9608 —12| 9434 —15 | 17-00 
17-25 | 9984 — 1] -9971 -— 2] -9949 -— 3] -9915 -— 4] -9863 — 6 | 9786 — 8 | 9677 -11 | 9527 —13 | 17-25 
= | 
? 17-50 | 9988 -—1)/+9977 -— 2] 9961 — 2] -9933 - 3] 9899 -— 5] -9826 — 7] -9735- —9| 9607 -12] 17-50 
2 17-75 | 9991 — 1] -9983 — 2] 9969 -— 2] -9947 -— 3] 9912 -— 4] -9860 — 6] -9783 — 8| -9675- —10 | 17-75 
7 18-00 | -9993 — |} ‘9987 -— 1] 9976 — 2] -9959 - 3] -9930 -— 3] -9887 -—5]| -9823 -— 7] -9732 — 9] 18-00 
| 18-25 | -9994 — | 9990 -— 1] -9982 -— 1] -9967 — 2] -9945+ -— 3] -9910 -— 4] -9857 — 6] -9780 — 8 | 18-25 
— | 18-50 | -9996 — | -9992 — | 9986 -— 1] 9974 -— 2] -9957 - 2] -9928 -— 3] -9885- — 5] -9821 -— 7] 18-50 
50 ' 18-75 | -9997 — | -9994 — | -9989 — | 9980 -— 1] 9966 - 2] 9943 -— 3] -9907 - 4] 9854 — 6] 18-75 
3:50 19-00 | -9998 — | -9996 — | -9992 — | -9985- -— 1] 9973 —-1 | 9955- — 3 | -9926 -— 3] -9882 — 5] 19-00 
oa 19-25 | -9998 — | -9997 — | -9994 — | -9988 -—1]|-9979 -1 | 9964 — 2/| 9941 -— 3] -9905- —- 4] 19:25 
425 19-50 | -9999- —| 9098 —| 9995+ —| 9991 —1| 9984 — 1] 9972 —2 | 9953 — 2| 9924 —- 3| 1950 
4:50 | 19-75 | -9999° — | -9998 — | 9996 — | -9994 — | -9987 -— 1] 9978 -— 1] -9963 -— 2] -99389 — 3] 19-75 
4-75 | 20-00 | -9999° — -.9999- — | -9997 — | -9995- - | -9990 —]| 99838 -1 | ‘9971 — 2] 9951 — 2] 20-00 
a } 20:25 | -9999% — | -9999° — | -9998 — | -9996 — | -9993 - | 9987 — 1) 9977 -— 1] 9961 — 2] 20-25 
st 20-50 | -99997 — | -99999 —]| -9999- — | -9997 — | -9994 — | -9999 - 1] -9982 -— 1] 9970 —-— 2] 2050 
= 20-75 | -99998 — | -99995 — | -9999- — | -9998 | -9996 — | -9992 —| 9986 -—1] 9976 — 1] 20-75 
230 | | | 21-00 | 9999"  — | -o999° + — | -go992 9998 9997 —| 9994 —| 9989 — 1] 9981 — 1] 21-00 
600 21-25 | -9999 — | -99997 — | 99991 — | 9999- — | -9998 — | 9995+ — | -g992 — | 9985+ - 1 | 21-25 
i | 
6-25 21-50 | -9999° — | -99998 — | -9999% — | -99991 9998 — | -9996 — | -9994 — | -9989 —-— 1! 21-50 
6-50 21:75 | -99999  — | .gg9ggs — | -99997 - | 99999 — | -9999 — | -9997 - | -9995- — | -9991 — | 21-75 
pag bd 22-00 | -9999° — | -9999° — | -99998 — | -99995 — | -9999° -9998 9996 — | -9993 — | 22-00 
7-00 22-25 [1-0000- — . gnege — | -99ygs - | -99998 - | -99992 — | -9998 | -9997 — | -9995- —] 22-25 
= 22-50 99999 — } -9999° | 99997 - 99994 | -9999 — | -9998 — | -9996 — | 22:50 
rik 22-75 1-0000- — } -9999° | -99998 9999 — | -99991 — | -9998 — | -9997 — | 22-75 
7-90 23-00 -9999° — | -99998 - | -99997 — | -9999* — | -9999- — | -9998 — | 23-00 
8-00 23-25 10000" — | -99999 — | -99995 — | .99995 — | .99991 — | -g998 — | 23-25 
8-25 ) 23-50 *9999° — | -99998 — | -99998 ae -99998 — | -9999- — | 23-50 
8:50) Ff | 93.75 | 9999  —| -9999°  —| .99997  —| -99995 — | 99999  — | 93:75 
8-75 24-00 |10000- — | -9999° — | -99998 — | -9999% — | -99997 — | 24-00 
9-00 24-25 | | 9999 —! -9999°9  — | .99997 — | .g999¢ — | 24-25 
9:25 24-50 | 1-0000- — | -99999 — | -99998 — | -99998 — | 24:50 
24-75 } | -9999° — | -99998 — | -99997 — | 24-75 
9:50 25-00 | 999°  —| 99999  — | .g9998  — | 25.00 
9-75 25-25 | | '1-0000- =— | -g999 + — | 99998 — | 25-25 
10-00 
in 25:50 | ‘9999 — | 99999  — | 25-50 
10-50 25-75 | 1-0000- —| -9999%  — | 25-75 
10-75 26-00 9999" = — | 26-00 
aan 26-25 | 9999 = — | 26-25 
7 26-50 1-0000- — | 26-50 















































ON CORRECTIONS FOR THE MOMENT COEFFICIENTS OF 

FREQUENCY DISTRIBUTIONS WHEN THE START OF THE 

FREQUENCY IS ONE OF THE CHARACTERISTICS TO BE 
DETERMINED. 


By E. S. MARTIN, B.Sc. 


1. Introduction. In certain statistical data, for example, in cases of disease 
incidence in infancy, wages, incomes and house-valuations, the exact point at 
which the frequency commences is not evident. Thus the number of cases of 
Whk«oping Cough under one year of age may be recorded, but it is very unlikely 
that any of these occurred under three months; the number of houses valued for 
income-tax purposes under £20 per annum is published, but no house is actually 
valued at nothing per annum; the number of divorces occurring wh the first year 
of marriage has been given, but it is clear that there is a certain minimum time 
which the legal formalities must take. Methods hitherto devised for fitting curves 
to such distributions have assumed the frequency to start. at “zero,” but this 
assumption may not only give poor results for the moments, but, in the case when 
the first frequency is less than the second, may make the distribution appear 
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non-asymptotic when it is really asymptotic. The object of this paper is to 
investigate the possibility of determining the start of the frequency by means 
of an auxiliary curve and to examine what improvement in the evaluated moments 
of the distribution is thereby obtained. 


2. Let the histogram of the first few sub-frequencies be as in Fig. 1, where 
the range of the first is X4o and the ranges of the others are ho. Let there be 
p sub-ranges taken from «=0 (at A) to #=a,. As in the paper by Eleanor 
Pairman and Karl Pearson*, let Z= [*yde, and assume that Z'is given in the 


neighbourhood of A by the Sth order parabola 


OS =) (Z) (z) ry. 
Z N+ajp +ee(7 + C3 zs +, ra +65 (7 ’ 
where N is the total frequency+. The c’s are now determined im terms of A by 
fitting the parabola to the first five frequencies. When 
@=NXho, Z=N—- 7, 


a=(1+A)ho, Z=N — nm — nz, eic., 
whence 


— Ny = ACy + A2Ce + A2Cg + A*Cg + APCS 
— ny —ng=(1+A)ey+ (1+ AP ce + (1 + A)cg + (1 +A) Ca t+ (1 FAP Cs 
— Ny — Ny — Ng =(2+A) 1+ (2Z+AP? cot (2+ AP eg +(2+A)*eg t+ (2+ APO; | 
— ny — Ne — Ng — Mg =(3 +A) G+ (3+A)P co + (B+A)> cg + (3 +A) eg +(3 +APC; 
— Ny — Ng — Ng — Ng — Ns =(44+A) Cr+ (4 +A)? Cot (44 APP 3+ (44+A)*ee + (44+AP 5} 














acael (I). 
If we put 
A=p—2, N=—M, MtN=—M, Mm+Nt+ N= Mz, 
NtNgtNgtN=M, M+ Ng tN t Nt Ns = Ms, 
the solution of these equations is 
2_] 2 +1)(W?-4 
he, = — AH = wet hn 4 Gt Oe me 
w—2 wl 
2—1)(u? — 4 —1)(u?—4 2_1)(u—-2 
_g#-De- Pere | ) (uw? — ) tg — 2 )(u Ri 
& w+ wt+2 
2) (u? —1 34 3u?-8u~-4 
jas y es )(w +m ae Y. <= coe Mg 
p—2 w—l1 
4u3 — 10 4u3 — 3u? — 8u + 4 4 — 2) (22 — p—1 
4§—-— oe Mg — 4 = 2 Se e al E Pie 
w+ p+2 


* Biometrika, Vol. x11. pp. 231—258. 
+ In the paper by Pairman and Pearson the form 


. . x a\? «\* z\ (z\ 
Z=N [a+a(Z)+e (Fz) +e(F) +ea(i) ra(Z) ] 


was taken, but the above form is chosen here as it dispenses altogether with the use of ‘* proportional 
frequencies.” 
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67 + 6u—-1 2 a 
24c3 = — belo At mei = 8 
p-2 p-l 
6u2— 5 “Vee eee es 
-6 z oy oy ‘ t4 a Gp - Re 
pt+l pt+2 
4p + 2 1 2 
24e, = 44 * oe on + 6 te 4° eae st Ms; 
B— 36: # | 2 
24¢, = — my + — m ai + m fetes SF m 
5 ee ai gt eel eee 


The coefficients in these equations have been evaluated for values of \ between 
OQ and 1. On substituting back for the m’s in terms of the n’s, the c’s are obtained 
as linear functions of 1, ng, v3, m4, m5. The coefficients in these expressions are 
given to twelve places of decimals in Table ITI. 


Now let my,’ be the sth moment of the first frequency about 0, ie., about 
x = Xho, as given by the auxiliary curve. Then 


Ah 
Mb” = [ ‘ y (a — Aho)’ dz, 


~ 0 
whence the first four moments are 

My pa = ho ($e So ; AF Co + 4r4c3 + trP cq + 1 r®¢s5), 
a7 5 

nye =— he? (4 r3 Cy + \ rn C2 + ao » C3 + HAS cy + aA" es), 
a? 4 > 

Maps = ho? (4 4c, + Wo APCz + YoASCs + gsA7 Cat ge APCs), 

ny pes” = ho! (°c + Js rs Co + 3)5 nr? Cg + gy r® Ca + whe »? C5). 


With the help of Table III, these may be expressed as linear functions of m1, 
Mz, Ng, Na, Xs; the coefficients are given in Table IV. 


{&Z 
ad, = ho (aes ). Ahy 


gives the abruptness coefficients at #=2Aho as linear functions of the true 
frequencies. These are 


(ty = Cy + 2Ace+ BA%cg+ 4A8cg+ = 5A4Cs, 


Next, the formula 


de= Qco + 6rc3 + 1202+ 2OATC;, 
dg = 6cs + 24Arcqg + 60A%C;, 
(4= 24cg + 120Ac;, 
a5= 120¢;, 


and again using Table III, the abruptness coefficients were calculated and are 
given in Table V. 


Now suppose the raw moments of the remainder of the frequency, when ny is 
excluded, are calculated about O, using a sub-range h. Let them be (WV — 1) 11'", 
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etc., and write po=h/ho and a,’ =a,po°. Then the corrected moments of the dis- 
tribution about 2 = Ah are as follows: 

Nya’ =(N —1y) 4" + Poh (aa — dy as! + aebyp ds’) + apa” 

N pa! = (N — ny) (v9'” — pyh®) — phy h* (a2’ — Sp aa") + maps” 

Nps’ = (N — ny) (v3'" — fhP ny") — poh? (ay' — Gye" + shy as’) + mp3” 

Nya’ = (N — ny) (va — hh vg!" + ah gh) + phe ht (ae! — Shaa’) + mips” 
provided the distribution be not abrupt at the other end; in that case the usual 
corrections for abruptness must be applied for that terminal. 


..(ID, 


If h = ho, then a,’ = as, and the equations may be simplified by putting 


1 a”? 
Ki= ye (G— gods + geyq 4s) + eo? 
K a . 5 ) 1 ” 
he= — zhy (4a — zoe) + pas » 
eee . ; 1 ” 
es a (a — 63 dg + xt5 4s) a 8 mes , 


' 1 ” 
Ky = rhe (ae = wy Ma) + x Nypa , 


whence the corrected moments are 


Nyy’ = (NV = N4) yy!" + hKy \ 

Nos! = e ny) (a — jh?) . h?Ke q ss a (IIL), 
Nuys’ = (N — 14) (v3'" — 4h? 1'") + BK | 
Nya! =(N — 1) (vg — AP vg!” + sigh) +1 Kia, 


The values of the K’s are given in Table VI. 


3. We now come to the most troublesome part of the problem: namely, to 
determine from a given set of frequencies the best value of \. Three methods will 
be proposed. 

lst Method. Following on the lines of Miss Pearse’s paper on asymptotic dis- 
tributions*, we may select \ by making our parabola fit the sixth frequency ng. 
When #=(5+A)ho, Z= N — ny, — ne— ng — ng — 5 — Ng and on substituting in the 
equation of the parabola we get 

Ng = — Ny — Ng— Ng — Ng— 25 —(H+A)G—(H +A) Ce 
—(5 4+ reg —(5 + A)*eg— (5 +A Os, 


giving 7 as a linear function of nj, ne, ng, Na, Ns, Whose coefficients are given in 


Table II. 


In order to select X in any given case, the values of ng are calculated from this 
table for various values of \ and that value of A is chosen which yields a result 
nearest to the given ng. 


* **On Corrections for the Moment Coefficients of Frequency Distributions when there are Infinite 
Ordinates at one or both of the Terminals of the Range,” Biometrika, Vol. xx4. pp. 314—355. 
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This method does not always work very well, thus (a) the value of \ so found 
sometimes differs widely from the true value, (b) the table may not give any value 
of i, ie., there is no auxiliary parabola, for which 2 is less than unity, which fits 
the first six frequencies, (c) the table may give two values of X, i.e., there are two 
such parabolas. 


2nd Method. If it is expected that the final frequency curve will rise abruptly 
from the a-axis, we may, instead of making our parabola fit the sixth frequency, 


impose the condition that its initial ordinate shall be zero. Now since y=— _, the 
ordinate at «= 0 is — i Henze we may use Table III to find that value of \ which 
0 


makes cy most nearly equal to zero. This test, while not always giving an accurate 
result, has been found in several cases to give a better result than the first test, and 
it has not yet in any case been found to give a double value of X. 
3rd Method. The poor results which the above tests sometimes give are no doubt 
due to the inadequacy of our parabola to represent various types of curves. 
Accordingly. for the purpose of choosing X a new type of auxiliary curve may 
be proposed. 
(i) If Z= N + Aa? be taken as auxiliary curve*, then Z contains three unknowns, 
A,qand d. We may hence fit this curve to the first three frequencies. We have 
— = ANY, 
—m —NM=A(1 +d), 
— Ny — Ng— Ng= A (242)? 


Writing My=™, M+N=M, Ny+Ng+N3= Mz, 
we get, eliminating 4, 
mg (- + “ Mg (; + x" 
m \ Xr Me 1+ A/ ’ 


Me i 1 +X 
: log (“*) _ leg ( M ) 
whence (”) = 4 e = s ; 
8 Me 8 1+2X 
is the equation from which to determine A. Writing 


(™*) | (" + *) 
8 my, ads 3M, 7 °8 xX i sly 


_—_—_ = -and ——$——. = —— 
AW 2 , 
“GW 


we have calculated the values of the latter ratio for various values of \, the results 
being given in Table I. All the computer has to do, therefore, is to calculate the 








value of Mh from the given frequencies and, referring to Table I, select that value 
8} q § 





3Me 
° gly . 3M 
of X for which Z,, 18 most nearly equal to the computed value of Me" 
she sills 


* «x stands here for x/hy. 

















E. S. Martin 17 


If q also be required, we have q = = and the appropriate value of sZ, can be 
shy 


ol 
gle 3 
(ii) Now three frequencies will often be insufficient from which to determine 2, 
especially if ho be large. To get a better result we may fit the curve 
Z=N+ Axte™ 


to the first four frequencies. Eliminating A, g and a from the resulting equations 
by successive divisions we get 


mM? (1+)? ) 
log (= =) log \\ (242) 


mse \ (2+ r-* _) ; 
log (me) os laaneen} 


taken from the column adjacent to 








aM, aly 
or = = . 
aM, ale 
M. 
4 
and q=*>. 
aly 
The values of * L. and 4/, are given in Table I and the procedure is as for three 
442 
frequencies. 


(iii) Similarly, fitting Z=N + Ante" 
to the first five frequencies, we have 
4 (“= ms) ja+ AP (3+ I 


my ms?) _ °8 } rn(2+2r)8 


ms* Ms B: (2+A)®(4+A))’ 
log Se =) log a +A)(3+ >t 





ae 
544155 #41 
- ie aha’ 
and q= a 
§#41 


The values of my and ;Z, are given in Table I. 
sLe 


(iv) If the curve Z=N + Agierrrrrrers 
be fitted to the first six frequencies, the equation for X is 
a ae 4(: 4 
(= m4 ) log | (3 +r) 








MzMs m5) _ A(2+A)*§(44+2) 
log (SEE) og { EE 
me ma? Me ((1 +A) (3 +A)8(5+A) 
Mi gla 
or sa es 
eM, gla’ 
and q =. eM, 


ela” 
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The values of ola 
ola 

(v) If the curve ZH N + Aaterrtorrex+dat 
be fitted to the first seven frequencies, the equation for is 


es = mz4)® me ‘ {c +r) (3 + AJ (5 + A)) 
© \my mg! m;°/ r r(2+ APO (44 r/P 


i ) i 4 (2 FAP (4+ ro (6 + “I ; 


and gf, are given in Table I. 





ma m4! mee °8 \(1+a)(B +A)" (5 +r? 

M, iL 
2 ee 

= Mz = zhe’ 
M, 

and a 
ze 

L 


1 ° ° 
L and 7, are given in Table I. 
7<* 


The values of 

This method, though apparently rather elaborate, is simple in application. If 
the logarithms of m,, me, mg, m4, m5, Mg and mz, are found, the five values of A 
corresponding to 3, 4, 5, 6 and 7 frequencies are quickly determined*. In any 
theoretical distribution it will be found that the more frequencies are taken, the 
more nearly does the corresponding value of \ approach the true value. In fact, 
among numerous theoretical frequencies that have been tested, we have not found 
one in which the test fails to yield the correct value of \ (to one significant figure) 
when the seven initial frequencies are taken. In experimental frequencies, however, 
this is not the case, as the values of My and Mz become more and more affected by 
the sampling errors as more frequencies are taken. Thus the value of \ given by 
seven frequencies may be an impossible one, while that given by four or five 
frequencies may yield good results. For this reason the present test may not be as 
efficacious as the previous tests in any particular case, but it has one advantage over 
thera; it can be used for either asymptotic or non-asymptotic frequencies. We have 
placed no limit on the value of g and if it is found that 0 < ¢< 1 then an asymptotic 
frequency is indicated. It may be remarked, however, that although the auxiliary 
parabola of the ng test is incapable of having an infinite initial ordinate, yet it 
can have a very large one and will yield reasonable results when applied to an 
asymptotic frequency, provided the degree of asymptoticity is not large. 

Our third method will be referred t as the “e test.” 

As an example consider the Type VI curve 

y = 33 x (160 pod , Starting at w= 4, 
Let ho = ‘2, X =°5, then from the formula 
x 12 7 
| yde= 10" (1 - Gon + Gaya) 
* We note that: ,M,= —log m, +5 log m, — 10 log my + 10 log m, — 5 log m, + log mg, 


and similarly for the other M’s. 
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we have the “frequencies 
m= 33,377, ns = 123,600, 
ne = 168,900, mg= 91,023, 
ng = 189,434, _%= 65,653. 


na = 160,671, 
Tables II and III give the following results: 


r Ng cy 
‘1 966,781 — 368,500 
2 276,218 — 171,696 
3 111,069 — 90,182 
“4 61,994 — 39,885 
5 51,108 — 3,641 
6 54,606 + 24,479 
‘7 63,513 + 47,148 
8 74,072 + 65,788 
‘9 84,638 + 81,248 
1-0 94,507 + 94,078 


These results are shown in Figs. 2 and 3. It is seen that the ng test gives 
(without interpolation) the two values °3 and 1-0 for A, while the c test gives the 
correct value ‘5. 


Applying our third test we have 


m= 33,377, log my = 4528 4473, 
Mg = 202,277, log mg = 5°305 9465, 
mg = 391,711, log mg = 5592 9658, 
mM, = 552,382, log m4 = 5°742 2395, 
| m; = 675,982, log ms = 5829 9351, 
| me = 767,005, log mg = 5°884 7983, 
| m, = 832,658, log mz = 5-920 4667. 


Taking in each case the nearest value of \ (without interpolation) from Table I, 
we get the following results: 
3M, “7824992... 
3M, 2870193 © 
| aM, _-495 4799 


3 frequencies. 





wey ) s. i = aomataen SG 59 = 

| 4 frequencies ‘My = 1377456 >” 71, A="4 
: M, ‘357 7343 

a *71eaS snl = . = ‘6967 = 5 

5 frequencies. “Me ~ 0761675 6967, r="'5. 
: M, :281 5662 

ey Pe 

8, = ~~” = 59875, ="5. 

6 frequencies “My 047 4218 = >? rA='5 

: M, ‘2341450 . a 

7 frequencies. “vw a 7246, rA='5. 


7M, 032 3137 


2—2 
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Values of Neg 
n Y=33 X\160)° x8 





120,000 


110,000 


100,000 


True 16 91,023 


990000 eet on 
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Values of C, 


6 
Y= 35 * (160) 2-8 








Fig. 3. 
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Hence the correct value of X is obtained by the use of five frequencies. 


The equation of the auxiliary parabola with the correct value ‘5 for X is 


“\2 
y = 18,206 + 1,513,518 (5) — 817,671 (5) 


3 4 
+ 176,087 (5) —14,187°94 (5) 
2 2 

The equations of the two parabolas given by the ng test are 


Pe ‘x s a\2 
A= 3, y= 450,911 + 771,550 (5) — 356,436 (5) 


3 4 
+49,760°9 (5) ~ 1,696-00 (5) 


2 2 
X=10, y=—470,389 + 1,656,578 (5) ~ 641,893 (5) 
\3 4 
+ 95,9146 (5) —5,146°46 (5) 


These parabolas together with the original curve are shown in Fig. 4. It will be 
seen that the parabola for which \ = 1:0 has, besides its incorrect starting-point, 
another serious fault; its first frequency 7 is composed of a negative and a positive 
part. 


As an example of an asymptotic frequency !et us take the Type III curve 


Take \ =°8, hhp= $9, then Akg ='4/'9. The first seven frequencies, from the 
Tables of the Incomplete T-Function, are shown below: 








Fo u I (u, p) n m log m 
4/9 t *365 4305 365,431 365,431 5°562 8054 
9/9 9 *620 7737 255,343 620,774 5°792 9335 

4/9 | 14 ‘770 3535 149,580 | 770,354 | 58866903 
19 /°9 1°9 *860 0470 89,693 860,047 5°934 5222 
2°4 /°9 2°4 *914 3798 54,333 $14,380 5°961 1267 
2°9 ,/°9 2°9 ‘947 4827 33,103 947,483 5°976 5715 
3°4,/°9 3°4 *967 7261 20,243 967,726 5°985 7524 


























Using the n column with Table IT, we have 
X='8, n= 88,314; 
. r='9, ne= 9,583. 


Hence, since ng is really 33,103, the ng test gives ‘9 as the nearest value of A, or 
‘87 by linear interpolation. 
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(t) y= 33x (160) x x8 





(2) Parabola with \=-S 
(c.ande tests) 


(a) Parabola with \= 3 
(ng test) 
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Using the log m column, we get 


3M, 2301281 


Mp ~ 093 7568 > = 2 
7 Ms ae joa = 2:9694, 
“ - he Be = 44942, 
OE = SOL 110) 5-408, 


7M, 009 4579 


»& => ‘8. 


Hence the correct value of \ is again obtained by the use of five frequencies. 
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The equation of the parabola with \ =°8, referred to its start, is 
4 2 
y = 1,263,615 — 919,415 (7) + 356,072 (=) 
0 


ho 
3 a 
— 72,5945 (7) +5,81189(F) , 
he ho 
and of that with \ =°9, referred to its start, is 
2 
y = 1,038,998 — 438,479 (7) +49,970°5 (7) 
ho ho 
a\* «\* 
+5,30522 (7) -1,10401 (7) . 
ho h 

These, together with the omginal curve, are shown in Fig. 5. It is seen that 
there is very close agreement between the curves for frequencies ng, ng, m4, M5 and 
that there is a good attempt at agreement at the start, especially in the parabola 
with the correct value of 2. 

The curves of our e-test would no doubt fit still better, and there is in fact 
some inconsistency in using these curves to determine the value of \ and then 
reverting to our original parabola. But it will be shown in the following examples 
that if the correct value of \ be determined, the corrections given to the moments 
by the parabola are usually adequate. Further, if the e-curves were used for 
correcting the moments, the K’s would have to be worked out ab initio for each 
example, owing to the impracticability of tabling for values of both and g. There 
is the further objection that with these curves the abruptness coefficients tend to 
diverge. We shall therefore be content to correct the moments by the use of the 
tables at the end of this paper, no matter how the value of \ may be determined. 


4. We will now investigate, in some further distributions, the extent to which 
the moments are corrected by the method of this paper. 


Example 1. Consider the curve y = 156 x 10% «(1 —)" from 0 to 1, which has 
a total “frequency” 1,000,000. We will take our first sub-range as ‘02 starting from 
«x =0 and the other sub-ranges as ‘06; thus \=}. The frequencies are as follows: 


7 Frequency x Frequency 
0—-02 26,951, 38— 44 11,965, 

02—--08 252,423, *44—-50 4,264, 
‘08—'14 281,978, 50— 56 1,302, 
14— 20 205,002, 56— 62 330, 
‘20— 26 122,555, 62— 68 66, 
°26—'32 63,781, ‘68— 74 10, 
32— 38 29,372, ‘74— 80 a. 


We have first to find X. 

(i) The ng test. Table II gives: 
when A="2, ne= 76,551; when A="3, nme=— 2,153; 
when A="6, ng = 53,936; when A="7, ne= 81,889. 


Since ng is really 63,781, this test gives the two values ‘2 and ‘6 for A. 
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(ii) The c, test. Table IIT gives: 
when A='8, cy = — 22,007; when AX="4, cy = 30,348. 
Hence ¢; is most nearly equal to zero when \ =°3, or ‘34 by linear interpolation. 
This is a good result. 


(iii) The e test. We have 


m= 26,951, log my = 4430 5749, 
Mz = 279,374, log mg = 5°446 1860, 
Ms = 561,352, log ms = 5°749 2352, 
Mm, = 766,354, log mg= 5°884 4295, 
ms = 888,909, log mg = 5°948 8573, 
Mg = 952,690, log mg = 5°978 9516, 
M7 = 982,062, log mz = 5992 1389. 
Whence, using Table I we have 
Ms _ 3.35131, x=15-, 
3Me 
M. 
aut = 2 5 ="t 
“7,7 424511, X= 31, 
M. 

oy _ x. = 954 

Ms 56104, rA='35t, 

oh 7-37 97, X="34, 

Me 

a 9291, r= 333. 

Me 


The values of X are obtained by linear interpolation. It is seen that 4 frequencies 
give a value correct to 1 significant figure while 7 frequencies give a very exact value. 
Let us take X=°3 and test Table IV to find the accuracy with which our 


auxiliary curve gives the moments of the first frequency about = 02. We get 
the following results: 

















] 
From Table | True value 
2 py” — 185°36 — 186°44 
pe” + 1°868 + 1°906 
Ny ps” _ "022 25 = *023 21 
1 py” ae “000 2914 | + °000 3126 
{ 





It is seen that the first moment is in error in the units figure, but since the total 
first moment of the whole curve about «=‘02 is 122,857}. this error will not 
seriously affect the results. Similarly the total second moment about #= ‘02 is 
23,2574, and hence the error in the second moment wil! not affect the sixth 
significant figure. Also since the total third and fourth moments are respectively 
5592 and 1593°572, we should again get at least six figures correct on adding the 
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corresponding moments of the first frequency. We can therefore say that the 
moments of the first frequency as given by Table IV are sufficiently accurate. 


We will now find the corrected moments of the total frequency about «=-02, 
taking h=1 as our working sub-range. With the notation of Equations ITI (p. 15) 
above, 


(N — ny) vy" = 2,063,961'5, (N —%) v3" = 26,400,593-4, 
(N — ny) r9"” = 6,538,164-25, (N — ny) v4'” = 126,209,278°6. 
The values of the K’s, obtained by using \ = ‘3 in Table VI, are 
Ky =— 16,2200, Kz =+ 46059, 
Ke=+ 325755, Ky=— 19550. 


The corrected moments are therefore 
Nyy’ = 2,063,961°5 — 16,220°0 = 2,047,742, 
N pe’ = 6,538,16425 — 81,087-42 + 32575 = 6,460,334, 
Nuys’ = 26,400,593°4 — 515,990°4 + 4605°9 = 25,889,209, 
Nyy’ = 126,209,278°6 — 3,269,082'1 + 28,380 6 — 1955°0 = 122,966,622, 
whence py’ = 2047 742, bs’ = 25°889 209, 
pa’ = 6'460 334, yg’ = 122-966 622. 
Transferring to the mean as here given, viz., 2°047 742 from the start of the second 
frequency, we obtain 
fea= 2°267 087, 
us= 3375 294, 
pug = 20°697 620. 
Now the true moments about the mean are easily found from the equation of 
the curve, and can be brought to the working scale by dividing yw, by (06). They 
are shown in the following comparative table: 























By present method | - True value °/, error 
Mean from start mf 2-047 742 2-047 619 +-006 
second frequency 
be 2°267 O87 2°267 574 — °021 
Ms 3°375 294 3°374 366 +028 
py 20°697 620 20°699 890 — O11 











It is seen that the percentage errors in all four moments are very small. 

It is of interest to compare these results with those which would be obtained 
by supposing the first frequency group on the same base as the others and finding 
the moments about its supposed start. We shall use the abruptness coefficients 
as given in the paper by Pairman and Pearson cited above. The raw moment- 
coefficients are 

yy’ = 3°050 486, vy = 53°183 388, 
ve’ = 11°645 874, vy’ = 280°271 217. 
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These, if taken without correction, give the following results: 
Mean from start of second frequency = 2°050 486, error = + 1407, 
ve= 2340409, error = +3°2127, 
vg= 3379081, error=+ 140%, 
vg= 21°776 860, error = + 5°203 7. 
Applying Sheppard’s corrections we get (v," and vs being uncorrected) 
ve’ — y= 11562 541, 
v3 —}'= 52-420 767, 
va — $ vq +555 = 274 477 447. 
With these corrections alone, we get 
fa= 2257 076, error =— 4637, 
#4 = 20°635 988, error= —*3097%, 


which are a good improvement on the raw moment results. 


Now the usual abruptness corrections * work out to be 

H, = — ‘013 828, H;=— 003 662, 
Hz = — 004 091, H,=— ‘003 872, 

whence the fully corrected moment-coefficients are 
ja’ = vy’ — Hy = 3064 314, 
He’ = ve’ — 5 — He = 11°566 632, 
Ms’ = 3’ —} vy’ + Hg = 52-417 105, 
fa’ = v4 —$ v9’ + sip + Ae = 274473 575. 


Hence H, acts the wrong way and gives a worse value for the mean than th» raw 
first moment: 


Mean from start of second frequency = 2064 314, error = + ‘8157 : 
Referred to this mean, 
fe= 2176612, error =— 4011 is 
Hs= 3633 669, error = + 7°685%, 
#a= 19131 702, error = — 7576 %. 
These are obviously much worse than the raw moments alone. 


Thus we have the remarkable result that Sheppard's corrections, which are 
obviously inapplicable since there is not high contact at the start, give very good 
results, while the usual abruptness corrections give very bad results, Now it is 
shown in the paper by Pairman and Pearson that the reverse is often the case, and 


* F. Sandon, Biometrika, Vol. xv1. pp. 193—195 (with an error in the value for 4,’), and Tables for 
Statisticians and Biometricians, Part 1. p. exciii, Introduction (error corrected). 
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hence one must conclude that the present result is due to the incorrect value given 
to the base of the first frequency. This in fact has cancelled those errors in the 
moments as given by Sheppard’s corrections and in the raw first moment, which 
are normally corrected by the abruptness corrections. To prove this, one may omit 
the first frequency and examine the moments of the remainder about its start. 
The results are as follows: 


Moments of the remainder of the frequency about its start. 








‘ . | With abruptness 
True value Raw moment | With Sheppard pera  ey 
(N— 7) py” 2 050 726 2 063 961 2 063 961 2 049 890 
(V — 27) po” 6 459 788 6 538 164 6 457 077 6 459 401 
(V— 2) ps” 25 888 986 26 400 593 25 884 603 25 889 420 
(NV — 2) py” 122 960 760 126 209 279 122 968 578 122 966 457 























Thus we see that when the first frequency is excluded from consideration, the 
raw moments are improved by Sheppard’s corrections, but the abruptness correc- 
tions make a still further improvement in all four moments. Hence the previous 
unusual result was due entirely to the effect of the first frequency. 


But obviously Sheppard’s corrections alone cannot always be relied on to give 
good results where their use is not justifiable, and since using the abruptness cor- 
rections for full first range may actually be worse than the raw moments, and give 
errors of more than 7°/, the need for our present method is clearly demonstrated. 


Lastly we may inquire whether the value ‘3 taken for \ can be improved. Using 
the four moments obtained by the present method and fitting a Type I curve in 
the usual way, we obtain the following equation, reduced to the original scale: 


4 3( ve \ 85, ay \ 10-9885 

Distance from start of curve to mean = 2°3793 x ‘06, distance from start of second 
frequency to mean = 20477 x ‘06, hence \ = °3316 and the base of the first frequency 
is ‘0199. 


Hence in fitting a curve it is best not to fix the start at the value given by our 
tables, but to fit with four moments. 


Example 2. The curve in the last example was one in which our tables could 
reasonably be expected to give a good result, as its equation could be expanded in 
the form of the auxiliary curve. We will now take a curve whose equation cannot 
be so expanded, 

y = yo 1 (1 — ay. 


If N = 108 as before, yo= 7,611,605°925. Take h =08, \=°5, then A= "04. 
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The “frequencies” are as follows: 














) 
| 
x Nn. | 
O— -04 | 180,698 
*04— °12 | 311,260 
"12— -20 212,991 
-20— +28 134,844 
-28— °36 79,796 | 
*36— +44 43,790 
*44— +52 21,913 
‘52— +60 9,726 
-60— “68 3,663 
*68— °76 1,082 
"76.— °84 216 
-84— +92 21 
*92—1°0 0 
tee | | 
Total 1,000,000 





(i) The test. Table II gives: 
when A= "4, ng= 198,469; when A="5, nge=— 19,300. 


Hence, since 7% is really 43,790, X=°5 is the nearest value and this is the correct 
value. 


(ii) The c, test. From Table IIT we have: 
when A ='8, cy = — 66,710; when X= °9, cy = + 6,469. 
Hence this test gives X= ‘9, a bad result. 
(iii) The e test. The results are as follows: 


Using 3 frequencies, ’ = *24, 


aS o r= °47, 
»” 5 » xr = "52, 
” 6 » A= “50, 
ee J r=°50. 
Hence again four frequencies are sufficient to give the value of X correct to one 
figure. 
Taking \ = ‘5 and carrying out the work as before, with h = 1 as working unit, 
we get 
(N — ny) vy/" = 1,511,250, (N — ny) vg’ = 19,012,399°5, 
(N — 1) ve’”’ = 4,643,887°5, (N — ny) v4'" = 92,962,029. 
Ky, =— 74,940°3, Ks= + 33180, 


K.=+ 14,6505, Ky= + 2832°7. 
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Nyy’ = 1,511,250 — 74,940 = 1,436,310, 
Nye’ = 4,643,887°5 — 68,275°2 + 14,650°5 = 4,590,263, 
Nuys’ = 19,012,399°5 — 377,812°5 + 3318-0 = 18,637,905, 
Nua’ = 92,962,029 — 2,321,944 + 23,896 + 2833 = 90,666,814. 
py = 1°436 310, bs’ = 18°637 905, 
ua’ = 4590 263, us’ = 90°666 814. 
The last three moments are now referred to the mean and give the following 
comparison with the true values: 

















True value By present method Error °/, 
“a 
Mean from start of 1-436 620 { 1-436 310 _ 099 
second frequency = \ 
He 2°525 587 2°527 277 + °067 
Bs 4°788 490 4°784 959 —*074 
ae | 27-616 548 | 27-627 739 +°077 





Let us again compare these results with those obtained when the first frequency is 
placed on the same base as the other frequencies, taking the origin at the supposed 
start of the first frequency. We get 

v;’ = 2420 901, v3 = 38319 701, 
ve, = 8530 864, va = 203-750 548. 
These if taken without correction give: 
Mean from start of second frequency = 1-420 901, error = — 1:0947, 
ve= 2670102, error = + 57227, 
vg= 4739218, error =— 1029 y a 
va = 29°616 321, error = +7241 2: 
Applying Sheppard's corrections, 
yy = 2420901, 
ve —~y= 8447 531, 
vs —}vy'= 37°714476, 
va — dveq' + gig = 199°514 283. 
Taking these as true moment-coefficients and transferring to the mean, we get 
e= 2586769, error =+ 2-422, 
Bs = vg = 4739 218, error =— 10297, 
fa = 28°310 447, error = + 2:5137%. 
Hence Sheppard’s corrections make a considerable improvement in the second 


and fourth moments, but the errors are still large; the good results which they gave 
in the first example were due to a good luck which is not repeated here. 
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The Pairman-Pearson abruptness corrections are 


Hy, = — 004 598, Hz = + 000 216, 

Hz = — 006 360, Hy= +005 776, 
whence the fully corrected moment-coefficients are 

py’ = 2°425 499, fs = 37°714 692, 

fe’ = 8453 891, pa’ = 199°520 059. 





The mean is thus 1-425 499 from the start of the second frequency, the error 


being — ‘7747. We get also 
Hea= 2570846, error = + 1°7927, 
Hg= 4738621, error =—1:041%, 
Ha = 28189 347, error = + 20747. 





These results are an improvement on the results with Sheppard’s correction 
only, but are considerably worse than those obtained by the method of this paper. 





Example 3. We will now try our corrections on an asymptotic frequency 


distribution. 


For this purpose let us take the Type III curve 











10° 
—*2 pz 
= —— a *¢%, 
J=Pp (8) 
Taking h=1/'8, X=°5, we get, from the Tables of the Incomplete T'-Function : 
| 
x u I (u, p) Frequencies 
4/8 5 *466 3930 466,393 
15/8 1°5 “807 4006 341,008 
2°5 /°8 2°5 *926 7582 119,358 
3°5 /°8 3°5 971 5767 44,818 
4°5 /'8 4°5 *988 8457 17,269 
55/8 5°5 995 5913 6,746 
6°5 ./*8 6°5 *998 2487 2,657 
7°5/'8 75 -999 3018 1,053 
8°5 4/8 8°5 *999 7208 419 
9°5 /°8 9°5 “999 8881 167 
10°5 V/°8 10°5 "999 9551 67 
115 /°8 11°5 “999 9820 27 
12°5 /°8 12°5 “999 9927 11 
13°5 J/*8 13°5 “999 9971 4 
14°5 /°8 14°5 *999 9988 2 
15°5 J/*8 15°5 “999 9995 1 
Total 1,000,000 


























(i) The vg test. Table IT gives: 
when X="6, ng=101,244; whenrX="7, ng=— 133,346. 
Hence, since 1 is really 6,746, this test gives \ = “6. 











Seer 3 





E. S. Martin 33 


(ii) The e test. This test gives: 


Using 3 frequencies, \ ="1, 
ene a r='3, 
mS r r~A=4, 
as. - r='5, 
re P rA=°5. 


Hence six frequencies give the correct value of X. 
We find, with h=1 as working unit, 
(N—m)4'" = 579,542°5, (N — ny) vs"" = 3,632,652°625, 
(N — ny) ve"” = 1,154,247°75, (N — mq) v4!” = 15,517,049°4375. 
With »=°5, Table VI gives 


Ky =— 186,6719, Kg=— 4,253°5-, 
Ke=+ 38,430°9, Ky=+4+ 13,7941. 
Applying the full corrections we get 
fa’ = “397 8706, Ms = 3°483 5135, 
fa’ = 1148 2114, pa’ = 14969 2831, 


whence the moment-coefficients about the mean are 
He= 989 910, 
Hg= 2238 961, 
Ha = 10°440 733. 

Now h has been taken as 1, whereas it should have been "8. Hence we multiply 
us by (/*8)* and get the following results: 

pa’ = 355 8663, Hs = 1°602 070, 
fe =*791 928, pa = 6682 069. 

Now the true mean is ‘8 from the start of the curve, or ‘8 — Ak = "352 7864 from 
the start of the second frequency. Thus the error in the mean as here obtained is 
+087 %. 

The true values of us, ws and yg are 8, 16 and 6°72 respectively. Hence the 


errors in our values are — 1°01 %, + 0°13 °% and — 0°56 '% respectively. 


These results, while not as good as those obtained for non-asymptotic frequencies, 
are still very reasonable, especially when it is remembered that nearly half the total 
frequency is contained in the first group. 


We will again compare these results with those obtained by assuming the first 
sub-range to be equal to the others. 


On trying Miss Pearse’s method we find no value of g, that is, the method fails 
to indicate an asymptotic distribution. 


Biometrika xxv1 3 
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The raw moment-coefficients about the supposed start of the first frequency are 
vy’ = 1°346 346, vs = 9°425 9295, 
ve’ = 2'963 538, vq = 39°854 073. 
The uncorrected moments about the mean work out to be 
ve = 1150 890, vg = 2°336 990, va = 11°465 859. 
Reducing to the base h=¥/"8, we get: ; 
Mean from start of second frequency = °346 346 x ./°8 ="309 781; error =—12°2 ; # 


ve= 1150890 x ‘8 =-920 7120; error =+ 151%, 
vy= 2/336 990 x (x/8)*=1-672 214; error = +451 %, 
va= 11°465 859 x ‘64=7°338 150; error=+9°20 a 


So a EL 


The percentage errors given are those which would be made if these results were 
taken instead of the true moment-coefficients. { 
Now applying Sheppard’s corrections, we have 
wy’ = 1:346 346, 
pe’ = 2°880 205, 
ys’ = 9089 343, 
pa’ = 38375 221, 
giving, with base /°8, 
be = 1:067 557 x8 ='854 0456; error = + 6°76 Y, 
Mg = 2336 988 x (/°8)® = 1672 214; error=+ 451%, 
#4 = 10°893 331 x 64=6°971 732; error=+3°75%. 
The mean is as above, with an error of —12'27/. 


Hence Sheppard’s corrections make a good improvement in the second and fourth 
moments. 


Applying now the abruptness corrections on the assumption that hoa=h, we obtain 
H, = +032 0855, Hg=+:011 2586, 
H,=— ‘003 6942, H,=— 003 1652. 


Hence py’ = 14’ — Hy = 1°314 2605, and Hy has again acted the wrong way. The mean 
is now ‘314 2605 x /°8 = "281 0832 from the start of the second frequency and is in 
error to the extent of 20:3°/. We have also 


He’ = 2°883 899, pg’ = 9°100 602, pa’ = 38°372 055, 
whence, reduced to base ,/*8, the moment-coefficients about the mean are 
Pe= 1156 618 x ‘8 = "925 294; error = + 15°7 yo 


pg= 2270 212 x (V8) = 1624431 ; error = +153 Y, 
pa = 11°467 132 x 64=7°338 964; — error=+ 921%. 
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Thus three of these moments are actually worse than the raw moments. The 
results of this example are shown in the following table. 


6 
Moments of Asymptotic Curve y = rye 





Mean, from 





he La. Smet 











x=} J'8 He Bs My 
True value *352 786 *8 1°6 6°72 
Present method *355 866 °791 928 1°602 070 6°682 069 
Raw moments: ... *309 781 *920 712 1°672 214 7°338 150 
With Sheppard... *309 781 *854 046 1°672 214 6°971 732 
Assumption, h= ho *281 083 "925 294 1°624 431 7°338 964 

















in 1912: 


Under 1 year... 


1—2 years 


2-3 
3—4 
. 4—5 
5—10 
10—15 





”» 


»”» 


» 


”» 


» 


Total ... 


Example 4. Age incidence of Whooping Cough cases in Metropolitan Boroughs 


212 
+ 
344 | 1463. 
270 
210 
253 

14 


1730 


One case between 40 and 45 years has been omitted. 


Now here we have the difficulty, common in such statistics, that after the first 
five years the frequencies are given only for 5-yearly groupings. To get » from 
the ng test a knowledge of the frequency in the sixth year is required; further, in 
finding the raw moments omitting the first frequency, the sub-ranges must be 


equal, and this is impossible as the frequencies stand at present. We have there- 
fore to make an estimate of the frequency in the sixth year, and either arrange the 
remaining frequency in groups 6—11 years, 11—16 years, etc., or split it entirely 
into yearly groupings. In the present case, as there is not a long tail, the second 








method is feasible, and may be performed as follows: 


























36 Moment Corrections with unknown Curve-Start 


Let us take five frequencies 71, ng, ng, M4, Ns On equal ranges of 2 units, and fit 
the parabola 


[" yda = ax + bu? + ca® + dat + ex 
Jo 
to them. We get, taking the origin as in the figure, 


-3 
™ = | yda = 2a —16b + 98c — 544d + 28826, 
5 





Ne =2a— 8b+26c— 80d+ 242¢, 
Ng = 2a + 2c + 2e, 
na =2a+ 8b+26c+ 80d+ 242e, 
Ns = 2a + 16b + 98c + 544d + 2882e, k 
whence 
3840a = 9ny — 11672 + 21343 — 1164+ Ins, i 
384b = 5ny— 34ng + 34ng— 5nz, 
384c =—my + 12rg— 22ng+ 12ng— Jez, 
768d =—n + 2ne — nat sz, 


3840e = m— 4ng+ 6ng— 4nrg+ sz. 


We now divide ne into five frequencies 71, v2, vs, v4, vs On equal bases, getting 


[ 5 ( 399n, + 40542 = 1931n3 a 72914 aie 126ns), 
—11/5 1 

Vg= | _yda = =—3- ( 691 + 3849ng— 1086n3 + 349ng— 56g), 
We += 7eqpg (— 111m +8819nq— 41mg— 56rmq+ 147), 


ge et 76S 





25 (— 181, + 2599nzg + 1039ng — 401ng+ 6975), 








= 
= dx = 176n1 + 1804ne + 20193 — 6214+ 99n 
V5 ie y — (- 1 2 3 4 Ns). 


To apply these formulae to the Whooping Cough statistics, we suppose for this 
purpose that the frequency starts at 0 years, and we have the five frequencies 
1463, 253, 14, 0, 0 on equal bases. Then the frequency 253 divides into 101, 68, 
43, 26, 15. 


If we take the five frequencies 253, 14, 0, 0, 0 and apply the above formulae to 
split up the 14 frequency, unsatisfactory results are obtained, some of the resulting 
frequencies being negative. We accordingly take the same five frequencies as 
before and divide ng into 14’, v9’, vs’, v4’, vs’ by the formulae 


-3/5 
‘ihe i yde = ao 126n + 1029ng + 2794ng— 671ng+ 99ns), 


-1/5 1 
ne same P mi 
" J aps 1 = T5605 (— 56m + 849m, + 8289ng— 526mq+ 69ne), 
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,_ ft em 
Vs =|", ydz = 15625 ‘ 14m, — 18l1n_,+ 3459n3s— 18lng+ 14s), 


»_ [% 1 
y= | ydo= >= ( 69m — 526g + B2B9Nq+ B49 — 56s), 


5 


. 1 

F = -= ——_, =e 9 x 

| ww 5" iggas! A Sit 
These divide the frequeacy of 14 roughly into 7, 3, 2, 1, 1. 


We thus have the revised data: 


Under 1 year ... ~ i ao nae 212 
1—2 years ae kates nae an ats 427 
2-3 , sy ah < Sai ies 344 
34 , 270 
4—5 , 210 
56, 101 
6—7T , 68 
7—8 , 43 
8—9 , 26 
9—10 ,, 15 
10—11 ,, 7 
1l—12 ,, 3 
12—13 ,, 2 
13—14 ,, 1 
14—15 ,, 1 
Total ... a“ osm — ... 1730 


(i) The ng test. Table II gives: 
when A= ‘4, ng= 269°840; when X="5, ng= 30°555. 
Hence *5 is the nearest value for X. 
(ii) The ¢ test. Table IIT gives: 
when A='8, c; = — 33°903; when X= °9, cy = + 53°638. 
Hence this gives \ ="8. 


(iii) The e test gives the following results: 


Using 3 frequencies, X= °3, 
» §& re A= 4, 
ae ‘ Ke & 
"Sa. , = rA> 10, 
ae ij A> 15. 


We have to choose from these the best value of 4, i.e. the value which gives the 
best fit to the total frequency. We may reject, perhaps, in this case the values of 
® greater than unity, but there is no guide as to which of the other values is best. 
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Accordingly, curves have been fitted for the values ‘5 and ‘8, using four moments. 
In each case a Type I curve is indicated by the f’s and the value ‘8 gives a better 
fit than the value ‘5. 


The fitting for \ = ‘8 is, with the usual notation, as follows: 


About the start of the second frequency, 





(N — 1)vy'" = 3717, (N — ny) v3" = 8387325, 
(N — ny) ve"’ = 151715, (N — ny) v4!” = 570882875. 
Ky = — 99°9286, Ky; = — 37007, 
Ke = + 30°0474, Ky =+ 75620, 
whence, using the full corrections, | 
py’ = 2-090 793, ye’ = 8713 900, 
Mg = 47°942 369, Ha’ = 325°635 238. 
These give Me= 4342 485, 


Ms = 11°564 934, 
fa = 95°909 587. 
Ai = 1°633 320, Bz = 5-086 106. 
These values indicate a Type I curve. Fitting with four moments, we have 


._ 582 — 681 — 6 





a ~? — 20-222 
1 32, — 282+ 6 0 a 
® 3028830 
: ee s 
4(r+1)"? 


b=r J pal + 1) _ 35.0749, 


€ 





m, +1, m,+1 are the roots of the quadratic m’—rm’ + e =0, whence 


my, = 0°62899, Mz = 17°59328. 





bing 
a= ———_ = 1:217584, ag = 
m4 + Me my + Me 


= 34:05665-, 


_N mm" (m+ mat+2) __ aap, 
JO= DB (ma + mg) F (my +1) 0 (mg +1) — 





Mean to mode = 2+ 2 ©2183 905. 
2r—2 pe 


The equation, referred to the mode, is 


‘ie ; @ 0.62899 : © 1759328 
= 5880 ( + [217 =a) ( ™ sa05sds) 


This curve is drawn in the diagram (Fig. 7). 


























hay, 
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WHOOPING COUGH 1912. 


xc 0-6290 xc 5933 
yn tes ss{l+ Far) (- stisce) 











25 | CASES. 







































































PE Re a 
0: Fo Bo SB SE OS Se SS OB Se ae a Se ee ee ee 


Years Fig. 7. 





To find the theoretical frequencies, the mid-ordinates of the groups (except the 
first) were caiculated, and the frequencies were calculated from the formula 
5760 


The first two frequencies were calculated from the formula 


ad b\™2/ar’\ matt 1 me (a’\ _ me(m_—1) (5) | 
! =: 7 = = = Bui > be ee See 
I. yda' = yo (;.) (<) “a m+1 m4+2 (5) ¥ 21 (m+ 3) \b 


N, = {5178y, + 308 (ypa + Yrua) — 17 (Ya + Yr+2)}- 
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obtained by expanding the equation of the curve referred to its start as origin, 
and integrating. The results are as follows: 











n observed n theoretical Contribution to x? 
212 213°6 ‘Ol 
427 418°2 "19 
344 363°5 1°05 
270 268-0 “01 
210 182°1 4°27 
101 117°1 2°21 

68 72°1 *23 
43 42°8 “00 
26 24°5 “09 
15 13°6 14 

7 7°3 

3| 3-3| 

2+14 1°9} 14°4 01 

1 0-9 

1 0°4 

x2=8'21 

















For goodness of fit, this gives P= 61. The goodness of fit to the original seven 
groups is given by P = "36. 

Now d = mean — mode = 1°623 905, 
and AO (Fig. 1) = base of 1st frequency 

=y—- py’ +d= ‘750 696. 
Now this indicates that the value ‘7507 for X would be a better one to take; if this 
is done, and the K’s evaluated by linear interpolation in Table VI, and a curve 
again fitted by four moments, a new value of X is found. 

The question naturally arises whether the series of values of X found by 
successive fittings tend to a limit; further, is this limit eventually attained if we 
start off with a bad value of X, e.g. the value ‘5 obtained from the m test? This 
point has been investigated, with the following results: 

¢; = 0 gives dy ='8 (nearest). 
Ist fitting with Ay="8 _— gives A» = ‘7507, 


2nd fitting with A,= "7507 gives As = ‘7421, 
3rd fitting with A3=°7421 gives 44 = °7406, 
4th fitting with \4= 7406 gives \, = *7403, 


5th fitting with As = "7403 gives Ag = *7403. 
The vg test gives Ay =*5 (nearest). 

Ist fitting with 4y="5 — gives A» = 6586, 

2nd fitting with Az = "6586 gives As = "7198, 

3rd fitting with As = 7198 gives \4 = ‘7367, 

4th fitting with \4=°‘7367 gives A; = ‘7396, 

5th fitting with A; =°7396 gives Xg = *7402. 
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Hence we may conclude that the values of \ obtained by successive fittings tend 
to a limit, here “7403, and this is the case even if a bad value is originally selected. 


These data therefore indicate that Whooping Cough starts at -2597 year of age. 


The curve fitted with the value ‘7403 for \ should thus give the best fit 
possible; the details of this fitting are: 


K, =—98°7699, Kz = + 285936, 
K,; =— 24606, K,=+ 66076. 
wy’ = 2091462, ys’= 8713060, 
ps’ = 47943086, py’ = 325634 686, 
fe = 4338847, 


ws = 11°570984, 
Wy = 95825 542. 
A. = 1639145, B,= 5090175, 
r = 19-9518], 


e« = 2965197, 
m = 617275, 
mg = 173384535, 
b = 3493440, 
a, = 1-201 223, 
dg = 33°73318, 
Yo = 427-027, 

d = 1680527, 


AO (Fig. 1) =°740 288. 


The equation, referred to the mode, is 


7-027 ( a  \oums POO ia 
y = 427-02 (1 + 7301 33) (1 a“ ss73ai8) 


The frequencies are calculated as before, with the following results : 














n observed n theoretical Contribution to x? 
212 212°5 “00 
427 419°2 15 
344 363°7 1-07 | 
270 267°9 “02 
210 182-0 4°31 
101 117°1 2°21 
68 72°1 23 
43 42°8 *00 
26 24:6 08 | 
15 13°6 14 
7 73 
3 3°8 
2>14 19} 14°3 “01 
1 0-9 
1 04 
' x2=8'22, n’=11 
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This gives P=-61. For the original seven frequencies P is still -36. Hence 
the result of our successive fittings has been to improve very slightly the fit at 
the start of the curve, but without any sensible increase of P. The conclusion we 
may draw is that if the start of the frequency obtained by fitting with four moments 
does not differ greatly from the value selected from the tables without interpola- 
tion, it is not worth while to make a second fitting. If this is not the case, then 
successive fittings steadily improve the goodness of fit; thus the above curve is 
much better than the one which would be obtained by using the value ¥="5 
obtained from the ng test. 


The curve with \ = "7403 is shown in Fig. 8. 


(\ | WHOOPING COUGH 1912. 


x 06173 x 17-3345 
y-4 2708 (H+ro5m5) (--s375m) 
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Example 5. Dwelling houses in England and Wales first assessed for Income 
Tax in 1929—30. 


Valuation No. of Houses 
Under £20 40,272 
20—40 104,243 
40—60 27,589 
Over 60 12,226 


The information here is meagre; no indication is given of the minimum 
valuation taken, and all houses valued at more than £60 are grouped together. 
The first point, namely the start of the frequency, must be discovered by our 
present method. The second difficulty can be met by a hypothetical spreading out 
of the frequency 12,226 into a suitable “tail.” 


y 


27,539 12,226 











1 > ~ 
Oo 1 2 oN 
Fig. 9. 
Let us assume the form y= we for the tail, the origin being as in the 
figure. 
Then [-yde= m1 —e“/*), and we have 
0 


1 
yde = 27,589 = N’ (1—e~*), 
J0 


[" yao = 39,815 =’, 
0 


whence 


| eet 


= 1180 679, 


and [. y da = 39,815 (1 — e102), 
0 
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Putting «= 2, 3, 4, ete. we get the corresponding frequencies which are shown 
in the following distribution : 


Valuation No. of Houses 
Under £20 40,272 
20— 40 104,243 
40— 60 27,589 
60— 80 8,472 ) 
80—100 2,602 
100—120 799 
120—140 245 
140—160 75 } 12,226 
160—180 23 
180—200 7 
200—220 2 
220—240 1 





Total 184,330 
(i) The vg test. From Table II, 
when A='l, ng= 868,753; when AX="2, ng = 247. 
Hence this test gives \ = "2. 


(ii) The e test. 
Using 3 frequencies, \< ‘05, q<°4, 


» 4 = rX='05, q ='5, 
» 5 » A='1, q=7, 
se i A='1, q=%, 
aS | * rA='1, G=" 


The values of ¢ indicate an asymptotic curve, therefore the cy =0 test was not tried. 


We find that the value X=‘2 gives the best results. The e test is remarkably 
constant for 5, 6 and 7 frequencies, but this is due to the fact that all the frequencies 
except the first two are already smoothed. 


With 7 ='2 we have 


Ky =— 18,898°41, Kg = + 4,045°02, 
K,=— 1,092°24, Ky,=+4+1,550°88. 
With the usual notation, 
(N — ny) ry'" = 129,484, (N — mm) vs’ = 501,121, 
(N — ny) ve’ = 201,810°'5, (N — ny) v4’”” = 1,691,053°625. 


Applying the full corrections we get the following moment-coefficients about 
the start of the second frequency : 
fy = 599 9327, ps = 2°564 9380, 
Ha’ = 1023 7803, pa’ = 8657 8470, 
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leading to 
a= °663 8611, 
pg = 171541947, 
pa = 4°324 9300, 
By = 4553 304, 82=9°813 515. 


Now 282—381—6 =— ‘032 882, which is sufficiently near to zero to allow a 
Type III curve to be taken. 


If the equation referred to the start of the curve is 
(2)’ ~(p+1)= 
Y = Yo = e a, 


we have p= 25 =— ‘121 5170, 
1 


a=2 Jes = ‘763 6693, 
By 


cut ore 
oa T(p+l) 
Start of curve to mean = a =°763 6693. 


= 198,236°8. 


Start of second frequency to mean =’ = ‘599 9327, whence base of first 
frequency =‘163 7366, and the curve starts at *836 2634 x £20, or about £16°7 
per annum. 


To get the theoretical frequencies, we have 





ae se = _W 
[yao N Ppt) NI(u, p) 


in the notation of the Tables of the Incomplete T-Function, where 
u=——— = Vp + 1~ = 937 2742=. 
Vp+1 a a 


The tables mentioned give the following theoretical frequencies which are 
piaced beside the observed frequencies for comparison. 








| 
| Observed Theoretical 
| | 
40,272 40,858 | 
104,243 102,972 
27,589 28,352 | 
| 8,472 8,441 
2,602 2,566 
799 788 
245 243°5 
75 } 12,226 75°5 } 12,147°5 
23 23°5 
7 7 
2 2 
1/ 1 ? 
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These results are in fairly good accord but the probability P is zero to five 
places of decimals. This is explained by the fact that the given frequencies are so 
large that they would have to be reproduced with extreme fidelity in order to give 
a reasonable value of P. 

It may be remarked that an attempt to fit a curve by four moments, assuming 
the base of the first frequency to be the same as the others and using the ordinary 
abruptness corrections, fails owing to the appropriate Type I curve omitting the 
first frequency. 


The curve obtained above is shown in Fig. 10. 


VALUATION OF 
DWELLING HOUSES. 
1929 ) 


SSS ay. - -- 


x Vi og7gste 
Y=198237 (Bsr) ahi 


PTOTE 





i) eke! as Siedler a eieticihatictebeted atin: 


25,000 | HOUSES. 






































Pr 7 
} £20 £40 260 £80 £100 £120 £140 
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Example 6. Cricket Scores, 1931. In order to make the scores as homogeneous 
as possible, only First Class County Championship matches have been considered 


and all “not outs” and uncompleted innings (“declared” or abandoned) have been 
excluded. The distribution is as follows: 

















Score (Runs) — = Score (Runs) ron nod 
0 701 81— 90 35 
1, 2 627 91—100 22 
3, 4 470 101—110 21 
5, 6 374 > 2109 111—120 15 
7,8 335 121—130 10 
9, 10 303 131—140 8 
11—20 895 141-150 3 
21—30 524 151—160 6 
31—40 330 161—170 1 
41—50 170 171—180 0 
51—60 105 181—190 1 
61—70 83 191—200 0 
71—80 56 201—210 1 
_ Total 5096 

















(i) The ng test gives, for X= 8, ng= 419; 
r= "9, vg = 284. 


Hence, since mg = 303, X = °9 is the nearest value. 


(ii) The e test gives the following results: 
Using 3 frequencies, X\="5 (q =°6), 
<= - rA="7 (q='S), 
4 rA>1 (q>1). 
Now we can see from the data that \ should not be greater than ‘5, hence we 


take the value ‘5 given by three frequencies in the e test*. The value of g indicates 
that the distribution is asymptotic. 


* This choice is supported by results obtained if we do not use the subdivision of the group 
1—10 runs, i.e., if we take as the first seven frequencies 701, 2109, 895, 524, 330, 170, 105. With these 
the e test gives: 

Using 3 frequencies, \=-05; 


” 4 ” A='1; 
reich ne A=" 
io i A=]; 
re ~ A="2. 


Hence, since our sub-ranges are now five times as great, the value \=*1 given by 4, 5 and 6 frequencies 
corresponds to the value \=*5 taken above. The e test here gives more constant results as the frequencies 
are more regular than those in the subdivisions. 
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Table VI cannot be used in this example, as ho is not equal to h. We get, from 
Table IV, taking h =1, ho = ‘2, X=°5, 


ima” =— 195362, .°. mya” = — 39-072. 


jams” =+ 68727, .*. mya” =+ 2749. 
jamns” =— 26°624, .°. mpg? =— 0-213. 


i Ny Ma =+ 10884, .. ny pa” =+ 0017. 


Table V gives 
M=— 977525+, «. a= 5a, = — 4,887°63. 


dg=+1,266°217, .°. as’ = 25ag=+31,655"4. 
= — 2,260°555, .. dg = 125a3 = — 282,569. 
dg=+2,510°921, .°. ag = 625a,=+4+ 1,569,326. 
ds =— 1,253°460, —.*.. as’ = 3125a; = — 3,917,063. 
The abruptness functions are 
ays is ae : 
Ae" a,’ B. : 
- 120 + 3024 = + 255°162. 
~ MW _ W __ ao ggs 
om on * > oO. 
TR. = — 838°577. 


126 1440 


Now to calculate the moments of the remainder about its start, when the first 
frequency is omitted, we must group together the next five frequencies with a 
total 2109 in order to have all the ranges equal, as required by the theory. We get, 
with h=1, 


(N—n) vy" = 8,488°5, (N —ny) v9" = 282,622°125, 
(N — ny) va’” = 38,07475, (N — ny) v4’ = 2,801,9546875, 
whence, using Equations II on p. 15, 
pa’ = 1629 719, ps’ = 55037 157, 
pa’ = 7°450 238, pa’ = 545°958 976. 


Transferring to the mean, we obtain 
a= 4°794 254, 
vs= 27°268 791, 
fa = 284742 098, 
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whence By = 6-747 900, Be= 12°388 240. 


These values indicate a Type I curve. When we endeavour, however, to fit a 
curve by four moments, we find that the curve just omits the first frequency. We 
therefore fix the start at the point given by \="5 and fit with three moments. 
We have: 

c=distance of mean from start = -1 + 1-629 719 


= 1-729 719, 


ie 
px 2lMa + use — Ha") _ _ G4 999 415, 


2us” — psc 


The negative value of r indicates that the type has changed to Type VI as a result 
of fixing the start. The working for a Type VI curve with fixed start is as follows: 


y= = = 1602 39628, 
ys = =—- = 1644 14188. 
2 ec 


(a— a)% 


If the equation is ¥=Yo an 


1 — 372+ 4¥3 _ 
Y2— Ys 


— 66°339 415, 


then qgza—Uuz= 


ys(i + y2)(y¥2— 1 — 24s) 
(2y2 —Y¥st+ 2 ys) (y2 — Ys) 





q2 o a= = 65°619 867. 


n= 65-979 641, 


q2 =_— 359 774, 
a = ——~ _=1738278 
dz l 
N is (q1) 


= — — = 1-228 559 x 10™, 
ear lN(i—r) [(q2+1) 
Hence the equation is 


2.097 )\—"359 74 
wed 3 (t— 173 8278) 
y = 1-228 559 x 10% —— 7 85°979 64 


The first seven frequencies can be calculated from the formula 


g +— 


SE r(r+1) 5 
get+1 q2t2- 2!(q2+3)~ PP y 


zx 
yda = yo a’ 24241 | 
~a 
where z=1-a/jz, yoa’—* = 52,640°23. 
The remaining frequencies can be calculated from the formula 


h 


My = 60 [5178y, + 308 (y,1 + Yrsa) — 17 (Yr + Yrs2)]- 
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50 Moment Corrections with unknown Curve-Start 


The results are as follows: 


























iam Observed Theoretical ; 
Frequency Frequency x 
0 701 682°6 0°496 
1, 2 627 657°1 1°379 
3, 4 470 466°0 0-034 
5, 6 374 372°1 0-010 
7, 8 335 310°7 1°901 
9, 10 303 265°7 5°236 
1l— 20 895 913-0 0°355 
21— 30 524 522°6 07004 
31— 40 330 319°3 0°359 
41—- 50 170 201°5 | 4924 
51— 60 105 129°7 4°704 
61— 70 83 84°7 0034 | 
71— 80 56 55:9 0-000 
81— 90 35 37°2 0°130 
91—100 22 24°9 0°338 
101—110 21 16°8 1-050 
111—120 | 15 11-4 1°137 
121—130 10 7°7 
131—140 8 53 
141—150 | 3 3°6 
151—160 | 6 2°5 | 
st . 4 - 30 ¥i > 24°8 1-090 | 
181—190 | 08 
| 191--200 0 06 | 
201—210 1 0-4 
over 210 0 1°0 | 
| 
Totals | 5096 5096 23°181 
| | 








Hence, since n’ = 18, P = +144 from Palin Elderton’s Table. 

The curve has been drawn in Figs. 11(a) and 11(b). Owing to the large 
frequencies on small ranges at the start, the distribution cannot be adequately 
represented in a single diagram. The first figure therefore gives the first six groups 
and the second figure gives the remaining groups on a reduced scale. The points A 
in the two diagrams correspond. It is seen that the observed frequencies are most 


irregular and therefore the curve fits worst in the neighbourhood of 10 and of 


50 runs. This can probably be explained on psychological grounds. 


5. Conclusion. To the question “Where does a given frequency distribution start ?” 
we can give only a qualified answer. Three tests for \ are suggested in this paper 
and they may give conflicting results. If, however, the distribution be a theoretical 
one, i.e., if there are no sampling or experimental errors, then the e test should 
always be used, as it can be relied on to give the start correctly. In other cases 
the choice of the best value may be guided from some a priori knowledge of the 
distribution or it may have to be made by fitting curves for each value of \ and 
selecting that which fits best. In all the examples we have chosen, at least one 
of the tests has given a suitable value of \. When the best value of \ has been 
found, the tables at the end of this paper provide adequate corrections for the 
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CRICKET SCORES 19351. 
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RUNS 
Fig. 11(b) 
moments. The best results will usually be obtained by fitting with four moments, 
but if this fails, the start of the curve should be fixed at the point given by the 
chosen value of X. 
I have to thank Professor Karl Pearson, F.R.S., under whose guidance the work 
was carried out, for his continual help and stimulating advice. 
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Moment Corrections with unknown Curve-Start 


TABLE LI. 


(Exponential Test.) 





3 frequencies 


4 frequencies 





























3b | ah 

r olns 3Ly r aA sly 

05 4°550 518 1°322 219 “05 8°741 460 1°031 6557 

‘1 3°708 312 1041393 | | <1 6°809 970 *760 566 

2 2°956 036 ‘778151 | | 2 5*122 760 514910 

3 2°570 064 636822 | | 3 4°275 215- *389 038 

4 | 2324251 544068 | | °4 3743076 | °3099857 | 

‘> ~=6| ~—« 2150 660 ‘477 121 | 5 3°371 237 “255 273 

‘6 | 2°020 213 425 969 | 6 3°094 107 ‘215 115+ | 

7 1°917 981 *385 351 7 2°878 355+ | +184436 | 

8 1°835 378 *352 183 8 2°704980 | °160297 | 

“9 1°767 062 “324 511 9 2°562247 | 140867 | 
1:0 1°709 511 *301 030 1-0 2°442 475— "124939 | 
1°5 1518 181 *221 849 15 2°047 415+ 075 721 | 

| i 








5 frequencies 






































| 3°674 188 


“016 153 | 





| | | 
| | * iL 
ny | pe sly r | ‘L. Py 
| 
| *05 | 13°312 918 “913 636 | 05 | 18150433 | *845 008 
ae 10-144 490 648 882 | ‘] 13°640942 | 584918 
3 7°414 998 414396 | 2) 9°794426 | -358510 
‘3. | ~~ 6:061 800 *298 039 3 | 7°905153 | -248872 
) £ | 5219689 | -227169 | -4 | 6736800 183648 | 
* | 4°635 132 179552 | 5 | 5°929568 "140 815- | 
| | 4201718 | 145591 | 6 5°333 254 110941 | 
UR A ae ce i oe 4°872 341 *089 224 
8 3596643 | 101037 | | ‘8 4504169 | -0729457 | 
9 3°375 784. | 085889 | | -9 | 4202601 | -060446 
10 3190 921 073 786 10 6| «639950642 | -050662 
15 =| = =2°584325- | 038737 | | 15 | 39126988 | -023748 | 
[ | ye | ej | Xe | 
) ae ee 
| 7 frequencies 
. = wee, 
| L | | 
N | a ain 
05 | 23190383 | +798 453 
‘1 | 17259963 | 542.038 
2 12°239659 | :321906 
3 9°791384 | -+217390 
*4 8284568 | °156 387 
5 | 7247196 | -117067 | 
6 6°483 021 090139 | 
7 5°893711 | 070912 | 
8 5°423 872 | 056 7507 
‘9 5039654 |  -046.063 
1°0 4719 088 | “037 839 | 
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TABLE IL 


Ng. 


53 






























































i 
r | +n — Ny | +g —N, +N; 
) 11°416 667 13°583 333 11°416 667 5*250 000 
ook 40°871 795* | 10°128 205- | 13°053 614 11°232 101 5°219 512 
Ve 16°910 173 9-089 827 12-576 840 11°059 524 5*190 476 | 
5, eo 9°427 714 8-238 952 12°145 663 10°897 815* | 5-162 791 
“4 5-968 296 7°531 704 | 11°754011 10°745 989 5°136 364 
a) 4-063 492 6°936 508 | 11°396825* | 10°603175- | 5°111111 
“6 2-903 196 6°430 137 | 11-069 863 10°468 599 5°086 957 
a f 2°147 687 5°995 170 10-769 535+ | 10°341 576 5°063 830 
| -8 1°631 683 5°618 317 10°492 794 10°221 491 | 5°041 667 
wes: 1-266 271 5°289 285* | 10°237 031 10°107 797 | 5-020 408 
1-0 | 1-000 000 5-000 000 10-000 000 10-000 000 | 5-000 000 
TABLE TIL «gq. 
| } ] | | 
ze —n, +My — Ns +n, | — Ns 
ian 3 Reo, epee le Ste. | 
a | | 
tae x 0 0 0 0 
| Ll | 11°951 764 469 467 | +281 610 530 533 “122 798 560376 | 043 641 915 814 007 277 439 024 | 
“2 6°838 474025974 | 553525974026 | +267 807 359 367 “098 738 095 238 | -016 761 904 762 
3. | 5°072935 153825 | 819856512842 | +435 412 717 928 | -166 119 890 768 | *028 683 139 535 
“4 4°152 342 755 284 1-083 657 244716 | -626057040998 | +246 609 625 668 | -043 272 727 273 
5 3°57 4 603 174 603 1°347 271 825 397 “840 228 174603 | +341 021 825 397 | “060 763 888 889 
| oo 3°171 451 133 408 | 1-612 548 866 592 | 1°078451 133408 | -450 164251 208 | -081 391 304 348 
| :7 | 2-870213 320777 | 1-880.983 107 795 | 1-341 281 598087 | -574.838 772283 | -105 390957 447 
8 2°634 189 640 769 | 2°153 810 359 231 | 1°629300 751880 | +715 842 105 263 | *133 000 000 000 
‘9 | 2°442 746 375 855 | 27432 073 068 589 | 1°943 111141937 | +873 966 444270 | -164 456 632 653 | 
1°0 2°283 333 333 333 | 2°716 666 666 667 | 2°283 333 333 333 1-050 000 000 000 | -200 000 000 000 | 
| | | 
Ce. 
i 
r +n, — Ng | +g — My +N; 
0 1 2°881 944 444 444 | 1°118 055 555 556 “381 944 444 444 | +062 500 000 000 
“1 =| 20°813 961 805 307 | 3-062 704 861 360 | 1°403 052 714 398 “506 947 285 602 | -085 203 252 033 
2 10°354 662 698 413 | 3-235 337 301 587 | 1-696 884 920 635 “643 115 079 365 | +110 634 920 635 
3 6°848 448 708 914 | 3°402 662 402 197 | 1-999 645 290 110 *790 354 709 890 | +133 837 209 302 
“4 5°084 988 540 871 | 3-566 678 125 796 | 2°311 417 112 299 *948 582 887 701 | 169 848 484 848 
dD 4-021 164 021 164 | 3°728 835 978 836 | 2-632 275 132 275 | 1°117 724 867 725 | +203 703 703 704 
6 3°308 675 523 349 | 3°890 213 365 539 | 2°962 286 634461 | 1°297 713 365 539 | -240 434 782 609 
7 2°797 898 918 049 | 4°051 624 891 475 | 3-301 512 363 427 | 1-488 487 636 573 | -280 070 921 986 
8 2°413 803 606 238 | 4°213 696 393 762 | 3°650 007 369 942 | 1-689 992 690058 | -322 638 888 889 
9 2-114 566 024 340 | 4°376 915 457 141 | 4°007 821 384.964 | 1-902 178 615 036 | -368 163 265 306 
1°0 1°875 000 000 000 | 4-541 666 666 667 | 4°375 000 000 000 














2°125 000 000 000 





*416 666 666 667 


















Moment Corrections with unknown Curve-Start 


TABLE III (continued). 


C3. 









oO 


SCEOMGGRENWH 


-_ 
. 

















3°763 229 243 512 
2°616 103 259 486 
1°947 089 947 090 
1°514 500 495 479 
1°215 232 754 657 
“998 046 331 106 
*834 683 230 328 
*708 333 333 333 











o 


DO 1HHAREL Dw 





_ 
.. 2 
oes 
CS 





aes 881 867 599 


344 576 719 577 
*316 055 060 077 
*293 100 578 676 
*274 870 335 561 
"260 687 140 043 

250 000 000 000 





Mo ae ch ehen eben, 


+n, 


oe 
3°576 282 097 132 
1°550 099 206 349 
*903 489 275 582 
*596 829 640 947 
*423 280 423 280 
*314 512 882 448 
*241 614 759 762 
*190 363 060 429 
*153 007 671 805 
*125 000 000 000 


a) 
"340 598 294 965 
*140 918 109 668 
"078 564 284 833 
‘O49 735 803 412 
‘033 862 433 862 
“024 193 298 650 


‘O17 897 389 612 | 


“013 597 361 459 
‘010 552 253 228 
‘008 333 333 333 


*826 388 888 889 
*757 051 236 202 
“699 900 793 651 
°652 066 279 973 
‘611 503 692 386 


*576 719 576 720 | 


"546 598 228 663 
*520 290 002 142 
"497 136 939 571 
*476 621 957 825 
*458 333 333 333 


| “O86 805 555 556 


‘O76 068 371 702 
‘067 415 223 665 
‘060 324 604 056 
*054 430 863 254 
*049 470 899 471 
*045 251 145 795 
*041 626 
“038 485 971 874 


“035 744 043 068 | 


| ‘033 333 333 333 


379 730 073 848 | 


419912 | 





| 2°018 143 773 427 
2°115 508 021 390 
2°210 978 835 979 
2°304 778 273 257 
2°397 095 499 755 
2°488 092 627 402 
2°577 909 351 185 
2°666 666 666 667 





*888 377 965 704 

“963 658 645 276 
1-039 021 164 021 
1°114 452 495 974 
1°189 941 537 282 
1:265 478 801 170 
1°341 056 166 056 
1°416 666 666 667 





—N, +My | — Ny +My —N; 
ae ae RE 
a 2°621 527 777 778 | 1°711 805 555 556 *663 194 444 444 | +114 583 333 33. 
13°317 393 333 129 | 2°540 940 000 204 —. 635 757 371 *738 126 147 391 | +130 691 056 911 | 
6°115 845 959 596 3°475 820 707 O71 | 1°918 623 737 374 °813 194 444 444 | °147 222 222 222 | 





“164 147 286 822 | 
*181 439 393 939 
-199 074 074 074 
*217 028 985 507 
*235 283 687 943 
253 819 444 444 
272 619 047 619 
291 666 666 667 








C4. 


“673 611111111 | 


*667 191 188 041 
*661 210 317 460 


"655 626 027 719 | 
-650 401 069 519 | 
"645 502 645 503 | 


*640 901 771 337 
*636 572 742 956 
*632 492 690 058 
*628 641 200 070 
*625 000 000 000 


“079 861 111 111 


‘075 446 779 813 | 
*071 473 665 224 | 
‘067 880 524 149 | 
‘064 616 755 793 | 
061 640 211 640 | 


"058 915 520 872 
“O56 412 795 774 


*054 106 620718 | 
*051 975 255 177 | 


*326 388 888 889 
| *332 808 811 959 
*338 789 682 540 
*344 373 972 281 
*349 598 930 481 
*354 497 354 497 
*359 098 228 663 
*363 427 257 044 


-367 507 309 942 | 


*371 358 799 930 


*375 000 000 000 


*0F;2 500 000 000 | 
-065 040 650 407 | 
-067 460 317 460 
069 767 441 860 
‘071 969 696 970 
074 074 074 074 
-076 086 956 522 
-078 O14 184 397 
079 861 111111 
081 632 653 061 
083 333 333 333 | 

| 





+1, 


| +045 138 888 889 
*043 600 839 234 


‘039 549 910 873 
“038 359 788 360 
| 9037 238 325 282 
-036 179 796 818 
*035 179 093 567 
*034 231 641 375 


“042 162 698 413 | 
040 815 128 024 | 


| 
| 
| 
| 
1 


*050 000 000 000 033 333 333 333 | 


ceil 


‘O10 416 666 667 
“010 162 601 626 | 
“009 920 634 921 | 
-009 689 922 481 
-009 469 696 970 
-009 259 259 259 
009 057 971 014 
-008 865 248 227 
“008 680 555 556 | 
“008 503 401 361 2 





“008 333 333 333 
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1 ” 
TABLE IV. 7 tifa - 


ho 


55 





























—n, +My — Ns | +1, —n; 
| 
0 0 0 0 0 0 
“1 *053 146 674 “000 449 173 *000 190 403 “000 067 022 *000 011 125 
2 *111 505 682 *003 389 207 -001 557 015* -000 563 833 *000 094 889 
3 “173 837 035 *010 866 342 -005 373 119 -001 998 761 *000 340 875 
“4 "239 262 372 024 618 517 “013 025 673 *004 970 771 000 858 667 
D “307 142 857 *046 199 157 026 023 0657 “010 174 851 *001 779 514 
“6 377 003 205* | -077 054 795— “046 002 205* | °018 407 333 “003 258 000 
"a -448 482 797 “118 575 634 “074735131 | °030571 163 “005 473 708 
8 “521 303 258 “172 129 631 "114135258 | °047 681 123 *008 632 889 
‘9 595 246 405* | +239 086 220 "166 263 359 “079 869 029 *012 970 125 
1-0 *670 138 889 *320 833 333 *233 333 333 | °101388889 | -018 750000 
L 
aay Ai 
he” ~ - 
] 
AY + My — Ny +s — Ny +s 
( 0 0 0 0 0 





“003 650 103 
“015 663 867 
*037 282 384 
“069 407 157 
“112 711 010 
*167 707 836 
*234 798 023 
°314 298 9757 
“406 466 153 
“511 507 937 


nh 


0 
“000 278 636 
“002 423 051 
“008 740 137 
*021 876 387 
*044 716 081 
“080 315 486 
*131 858 723 
*202 627 203 
*295 977 878 
*415 327 381 


“000 045 316 
-000 689 594 
-003 342 812 
*010 174 304 
*024 038 742 
“048 444 736 
“O87 549 158 
"146 167 819 
*229 797 043 
"344 642 857 


0 
“000 004 102 
*000 125 549 
*000 917 673 
“003 742 508 
“011 104.929 
*026 975 571 
*057 118 020 
*109 429 483 
*194 300 820 
*325 000 0CO 


“000 019 321 
*000 320 269 
‘001 678 446 
-005 487 556 
*013 849 817 
*029 670 121 
“056 754 O81 
“099 910 037 
°165 055 118 
*259 325 397 


“000 006 814 
“000 116 407 
“000 627 664 
-002 108 127 
“005 458 219 
*011 979 901 
“023 449 694 
*042 196 O86 
“O71 181 282 
“114 087 302 





0 
*000 001 756 
“000 058 724 
000 465 341 
-002 043 440 
006 490 247 
*016 788 729 
“037 682 934 
076 221 175+ 
142 370 050* 
*249 702 381 


+My; 


0 
“000 000 620 
-000 021 396 
*000 174 614 
*000 788 408 
“002 570 858 
“006 818 053 


‘015 669 5657 | 


“032 414 776 
“061 853 498 
“110 714 286 


| 


-000 001 132 
“000 019 625*> 
-000 107 316 
-000 365 3457 
“000 958 306 
“002 129 814 
*004 219 488 
“007 681 354 
“O13 103 669 
*021 230 159 


0 

~000 000 103 
“000 003 611 
“000 029 904 
-000 136 921 
“000 452 500- | 
“001 215 604 
*002 828 545+ 
*005 921 321 
“011 429 177 
*020 684 524 





TABLE IV (continued). 


Moment Corrections with unknown Curve-Start 








~ 


me oO | 


SOTA HE 


“000 022 553 
-000 395 663 
“002 155 292 
“007 232 436 
018 564 080 
“040 168 896 
"077 200 853 
*135 988 366 


-224 063 6457 | 
“350 185 185* | 


“000 000 366 
“000 022 502 
“000 247 619 
“001 351 176 
“005 028 172 
“014 703 127 
“036 429 862 
-079 992 630 
“160 224 084 
*298 558 201 





TABLE V. 








a” 
he Moa - 
| +g —M, 
| 0 0 
| -000000157 | -000000 056 
| -000010577 | -000003 860 
-000 126 434 000 047 558 
-000 744 069 000 287 950 
| *002 967 929 001 179 826 
| +009 252 508 “003 772 720 
| -024 325 338 "010 160 223 
| +056 439 374 024 117 893 
; *119005 540 “051 968 585+ 


*232 658 730 


ay. 


| 
| 


*103 716 931 | 


0 
-000 000 009 
“000 000 652 
*000 008 154 
“000 050 082 
“000 208 029 
“000 674 000 
“001 838 150— 
-004 416 456 
“009 627 849 
*O19 431 217 | 








SHaHIAAK ODEO 


8°174 359 079 
3°382 034 632 
1°885 542 836 
1°193 659 282 
*812 698 413 
*580 639 168 
*429 537 351 
+326 336 675* 


] 


0 
*257 692 412 
"465 367 965+ 
*635 542 836 
*776 992 615+ 
“896 031 746 
*997 305 834 


1-084 299 255* 


1°159 670 008 


*105 943 951 
*201 298 701 
"287 534 087 
“365 864 528 
*437 301 587 
“502 694 166 
“562 759 568 


“618 107 769 


-036 913 192 
‘O71 428 571 
“103 770 261 
*134 135 472 
*162 698 413 
*189 613 527 
*215 018 210 
*239 035 088 


*006 097 561 
“O11 $904 762 
“O17 441 860 
*022 727 273 
‘027 777 778 


“032 608 696 


"037 234 043 


-041 666 667 




















| °9 °253 254 O77 1°225 476 300 “669 260 542 *261 773 940 045 918 367 | 
| Id | *200 000 000 1°283 333 333 “716 666 667 | *283 333 333 | “050 000 000 | 
= = s mee! os { | | 
dg. 
ies | | 
r +My — Ny | +N | —nN, +N; 
eae 5 ones [ssa _| 
0 oo 5°763 888 889 | + 2°236 111 111 —*763 888 889 | +°125 900000 
i 34°059 829496 | 4°690170504 |+1°794677 981 | —-610 083923 | + °099 593 496 
2 14°091 810 967 3°824 855 700 \+ 1°397 366 522 — *466 269 841 +075 396 8257 | 
3 7°856 428 483 3°115 793 739 = |+1°038 052 415~ | —°331 512 802 + °052 325 581 
“4 4°973 580341 | 2°526419659 (4+ °+711 675579 — *204 $91 087 Be *030 303 030 | 
5 3°386 243 386 2-030 423 280 |+ +414021 164 — °085 978 836 | +°009 259 259 
5% 2°419 329 865— 1°608 447 913 (4+ *141 552 O87 +°026 167 472 | —-010869 5657 | 
| 1°789 738 961 1°245 975 325- |-— +108 720423 +°132 020 318 | — *030 141 844 
8 1°359 736 146 “931 9380 521 — 339 337928 | +°232 090 643 os 048 611 111 
9 1°055 225 323 °657 737 640 |— °552 474 482 + *326 835 863 — 066 326 531 
1°0 *833 333 333 *416 666 667 | — *750 000 000 +416 666 667 — *083 333 333 



































TABLE V (continued). 
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dg. 

nN —n +N, —Ns +N, — 

0 oa) 15°729 166.667 10°270 833 333 3°979 166 667 *687 500 000 
3 71°525 641 943 | 13°474 358 057 9°343 823 761 3°656 176 239 *634 146 341 
2 29°592 803030 | 11°657 196 970 | 8*509 469 697 3°354 166 667 *583 333 333 
3 16°498 499 815— | 10°168 166852 | 7°754910071 3°071 176 885* *534 883 721 
°4 10°444 518 717 8°930 481 283 7°069 518 717 2°805 481 283 “488 636 364 
1) 7111111 111 7°888 888 889 6°444 444 444 2°555 555 556 “444 444 444 
6 5°080 592 716 7°002 740 617 5°872 259 383 2°320 048 309 “402 173 913 
“7 

8 


























4°295 373 507 | 
3°263 366 7507 | 


8*990 340 779 
8*236 633 2507 | 





| 10°539 070 986 


9°985 588 972 





5°683 151 236 
5°444 982 456 








1°127 659 574 
1083 333 333 


3°758 451 819 6°241 548 181 5°346 687 113 2°097 757 332 *361 702 128 
2°855 445 906 5°582 054 094 4°862 390351 | 1°887 609649 *322 916 667 
“9 2°215 973 178 5°006 249044 | 4°414 803 587 1°688 644 689 *285 714 286 
10 1°750 000 000 4°500 000 000 | 4°000 000 000 | 1°500 000 000 *250 000 000 
a4. 
r +n, | — Ny | + Mg —M, + Nz 
coe ae tes 
0 a) 19°833 333 333 16-166 666 667 7°833 333 333 1°500 000 000 
| 81°743 590 792 17°256 409 208 15°107 227 155* 7°464 201 416 1°439 024 390 
*2 33°820 346 320 15°179 653 680 | 14°153 679 654 7'119 047 619 1°380 952 381 
3 18°855 428 360 13°477 904 973 13°291 325 796 6°795 6380 726 | 1°325 581 395+ 
“4 11°936 592 819 | 12°063 407 181 12-508 021 390 6°491 978610 | 1°272 727 273 
*5 8-126 984127 | 10°873015 873 11°793 650 794 6°206 349 206 | 1°222 222 222 
6 5°806 391676 | 9°860274991 | 11°139 725 009 5°937 198 068 1°173 913 043 
| 
8 
*S 
0) 




















2°532 540775- | 7°578 570336 9-474 061 243 5°215 593 930 1-040 816 327 
1 | 2-000 000 000 7°000 000000 | 9000000 000 5°000 000 000 1-000 000 000 
s- 
r —n + Ny — Ng +N — Ns; 
0 © 10°416 666 667 9°583 333 333 5416 666 667 1+250 000 000 
*1 =| 40°871 795 396 9°128 204 604 9°053 613 578 5+232 100 708 1°219 512 195* 
‘2 | 16°910 173 160 8089 826 840 8°576 839 827 | 5°059 523 810 1-190 476 190 
3. | 9:427 714180 7°238 952 487 8°145 662 898 | 4°897 815 363 1°162 790 698 
"4 | 5°968 296 409 6°531 703 591 7°754.010 695* | 4°745989305- | 1°136 363 636 
*D 4-063 492 063 5°936 507 937 7°396 825397 | 4°603 174 603 1-111 111111 
6 2°903 195 838 5°430 137 495* | 7:069862505- | 4°468 599 034 1°086 956 522 
7 2°147 686 753 4-995 170 389 6°769 535 493 | 4°341575 618 1°063 829 787 
8 1°631 683 375+ | 4°618316625- | 6°492794486 | 4°221 491 228 1°041 666 667 
_ 1°266 270 387 4°289 285 168 6°237 030621 | 4°107 796 965- 1°020 408 163 
1°0 1°000 000 000 4-000 000 000 6-000 000 000 4°000 000 000 1:000 000 000 











































































































58 Moment Corrections with unknown Curve-Start 
TABLE VI. &;,. 
r Ny Ny Ns Ny Ns 
0 —-@ — *021 501 598 +°013 948 137 — °005 347 498 + °000 913 525+ 
+} — 636 353 675+ | — -039 437 722 + °021 316 401 — °007 914 081 + °001 337 436 
2 —°352 799 985~ | —-051 314 487 + °026 752 959 — :009 879 800 + °001 667 992 
3 — *308 362 782 — °055 978 631 + °029 089 396 — °010 752 319 + °001 817 055+ 
“4 —°324 425 067 —°052 318 318 + °027 025 398 — 009 946 742 +°001 676 356 
5 — °365 125 556 — °039 230 633 +°019 124 747 — °006 780 511 +°001 115 842 
6 — *418 429 429 — °015 600 485+ | +-°003 811 099 — -000 468 312 — -000 017 978 
‘7 — +479 128 525* | +°019713730 | —-020636 406 + °009 882 997 — °001 903 687 
8 —°544 586042 | +:°067 890 333 — °056 087 666 +°025 279 452 - 004 746618 | 
9 — 613 315045 | +-°130 151913 — °104 566 227 + °046 845 034 — 008 780513 | 
1:0 — ‘684 408 069 +°207 771 164 — °168 253 968 +°075 826 720 — 014269180 | 
Ko. 
r ny Ny Ng Ns ns 
0 —@ +°041 473 765+ | —-013 288 139 +°003 775 353 — °000 545 6357 
*] — ‘253 150 199 +033 332 954 — ‘009 940 553 +°002 608 898 — *000 352 946 
2 —°090 583 9157~ | +°026 164 477 — °006 644 003 +001 414 993 — ‘000 152 018 
3 — °021 952 593 +°018 165 156 — 002 576 711 — °000 112 290 + °000 109 623 
“4 + °031 907 940 + *006 889 972 + °003 693 177 — *002 546 686 +°000 533 695+ 
a9) + °087 179 810 — 010 714 122 + °014 299 657 — °006 794 093 +001 285 319 
6 +°149 466 856(5)| — -038 301 676 +°032 174 292 — °014 161 322 + °002 608 592 
fs +°221 303 959 — ‘080 139 026 +°061 145227 | —-026 429 212 + °004 843 573 
8 +°304046996 | —°141125 485+ | +°106 039 965(5)) — °045 930 103 +°008 444 692 
‘9 +°398 510089 | —*226822037 | +°172 792029 | —°075 629 647 +°014 000 575+ | 
1°0 +°505 224 868 & "343 485 4507 +°268 551 587 | —*119 212 963 +°022 255291 | 
| | 
Kg. 
r ny | Ne Ns | ny Ns, 
spe 
| | 
0 +0 | +°030 123 595~- | —-019 380374 | +:°007 330 936 —°001 233 879 
*] + °066 421 864 +°032 230396 | —-020 246 603 +°007 632 757 — *001 283 736 
2 + °025 173 412 +°034046 417 | —-021 081 639 +007 935 169 — °001 334 630 
3 + 006 645 369 + °036 227 120 — *022 191 913 + °008 352 286 — °001 406 104 
°4 —°012 136 459 | +°040 206 147 — ‘024 409 167 + °009 213 852 —*001 556 248 
5 — "038 084 688 ’| +°048 539 894 — *029 438 880 +°011 229 368 — °001 913 038 
6 — °075 577 632 | +°065 236904 | —*040 270947 | +°015 696 183 — 002 715 561 
“7 — ‘128 353 817 +°096 089196 | —*061655270 | +-024 754989 = “004 366 244 | 
8 — °*199 964 386 +°149 015 662(5)) —°100 645137 | +°041 696 171 — 007 495 188 
9 — °293 911 395+ +°234 423961 | —°167211404 | +°071320436 | —°013037737 | 
1-0 — *413 695 437 | + 365 595 238 | — *274 930 556 | + °120 357 143 | — +022 326 389 | 
| 
tx | eee ae erat Pee PS IS eee 
Ky. 
r ny | ng | Ns ny nN; 
— --— | —— - —- 
0 +a —°031 972002 | +-:006 540 062 — 000 622 795+ | — -000 049 603 
q +°213 572 278 — 025 240 324 + °003 752 503 +000 341 482 — *000 208 889 
= +088 749 O81 — *019 837 073 +°O01 271 844 + °001 239 368 — *000 359 955+ 
3 +°051 413 852 —°015 616 484 + °000 865 142 + *002 040 576 —*000 497 106 
*4 + °038 415 996 — °013 024 752 — *002 293 838 + °002 593 455+ | — -000 593 256 
*D + °039 795 288 — °013 591 937 — ‘001 936 224 + 002 447 767 — °000 567 250+ 
6 + °055 337 711 — ‘020 621 173 + °002 640 O17 + °000 558 013 — *000 227 483 
7 + *088 422 233 — 040 075 263 +°016 143 678 —*005 165 810 + °000 815 832 
8 +°144 513 696 — 081 669 020 + °046 811 780 —°018 496 055 | +:°003 278 339 
9 + °230 679 740 —*160 181 328 + °108 041 613 —°*045 752 710 + °008 378 659 
1:0 + °355 410 053 — *297 003 968 +°220 456 349 — *096 937 831 +°018 075 397 
































ON ASYMPTOTIC FORMULAE FOR THE 
HYPERGEOMETRIC SERIES 


Il. HYPERGEOMETRIC SERIES IN WHICH THE FOURTH 
ELEMENT, z, IS NOT NECESSARILY UNITY 


By O. L. DAVIES, Pa.D. 


THE hypergeometric series was first introduced by Euler in 1778 when attempt- 
ing to obtain solutions of the second order differential equation 


d* d 
a (a+ bar) SY + a(c+ ear) 2 + (f+ ga")y =0. 
A particular case of this equation is 
pe Py . ri ( dy = 
a(1—#) 73 +iy—-@+B8+1)a} 7 —aBy=0, 


which has the hypergeometric function 


1 1 


as one of its solutions. 


The chief problem in the study of series is to determine the partial sums and 
in particular, the total sum. In this paper we will obtain approximations to the 
sum of a number of terms of the hypergeometric series by fitting certain systems of 
frequency curves to the series and then expressing, approximately, the required sum 
by means of an integral. 


The hypergeometric series will be looked upon as a discrete frequency distribu- 
tion; we restrict our attention, therefore, to series with positive terms only. 


In § 1 the system of frequency curves which arises from the differential equation 


ldy_ _Yrii-Y 
ydz 


’ 


c 
5 (Yrs t Yr) 


where y, is the rth term of the series, is discussed. This equation was originally 
employed by Professor Karl Pearson to obtain curves to fit the series F(a, 8, y, 1) 
in which the last element is unity. The curves which arise from its integration are 
known as the Pearson System of Frequency Curves. These will be denoted in the 
subsequent work by P(z). When the last element of F is not unity, the resulting 
system of frequency curves has the more general form e~ P(z). The goodness of 
fit of this new set of curves to the general series is tested in a variety of examples. 
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In § 2 an alternative method of curve fitting is given. 


2 


§§ 3—6 are devoted to the evaluation of the integral | e~°? P (z) dz. 
xy : 
§§7—9 give further and more accurate methods for summing the tail of the 


series, while § 10 is mainly a summary of the results and conclusions arrived at in 
the previous paragraphs. 


$1. On the Fitting of a System of Frequency Curves to the 
Hypergeometric Series F(a, B, y, £). 


Let y, denote the rth term of the hypergeometric series 


8, e(a@+1)B(B+1) » 

lly Qiy(yt1) 

—¢(@t1)...(@+r—2)B(B+1)...B+r—2) 4 
(r—1)!y(y+1)...(y+r—2) he 


1+ 


Form the ratio 
(a+r—1)(8+r—-1) me 








——* § 1 
Yrti— Yr _ r(yt+r—1) 
Yuty, (at+r— 1)(8+r—-1) 
rqy+r—1) “a+1 


_ r(@—1)+r(at+B—2e-y—- 1)+(@-1)(8—-I)« 
r(a+1)+r(a+B—2e+y—-1)4+(a—-1)(B-l)« 
Let each term of the hypergeometric series be represented proportionally by 
a rectangular block of width c and assume the weight of each block to be concentrated 
at the mid-vertical. The ratio 
Ay, 


c 
5 (Yr + Yrtt) 


is equal to the slope of the line ef divided by ab, ab being the ordinate of ef at 
the point a. 





b | 

















Yr a Yr 


<—C—_ 
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We wish to test the goodness of fit of the curves for which +4 is equal to this 


ratio at all points a midway between the mid-verticals of the rectangular blocks 
composing the histogram of the series*. Evidently, the equation of the required 
curve is 
ldy _ Ay, 
aa ei ae ot 
y > (Yr + Yr) 
where rc =z, measured from a suitable origin. 


The distance of the rth mid-ordinate from a point O, distance ¢ before the first 
mid-ordinate, is rc. 





7 





= 
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I Ys 

| | | 

| | 

Yo| | 

Vy 1 | | 

O l 1 1 l 
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The relation (1) is true for all points at distances (r + 4), (r= 0,1, 2, ...) from O. 
Hence the equation of the curve which fits the series is 
1 dy _ ( Ay, 
y dz 


eC) 


= Nisin 


z being measured from the origin O. 

It will simplify matters considerably if the origin be moved to the start of the 
histogram. This necessitates replacing re of (1) by 2. The equation of the required 
curve referred to the new origin is then 

ldy 22*(#—1)+2c (a+B—2e- y—-1)+@-1)(8- 1) xe* 
ydz ¢ 22(#+1)+2c(a+B—2e+y-1)+(@-1)(B—-l) ae 

The mid-verticals of the rectangular blocks are now at the points 


z=(r+4h)c, (r=0, 1, 2, 3, ...). 


»»-(2). 


* The reader must bear in mind that we are not proceeding to a limit; c is finite. 
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The terms of the hypergeometric series will continue to increase as long as 
Yr41/Yr > 1. It is reasonable, therefore, to determine the position of the modes from 
the values of r which satisfy ¥,,:/y,=1, or what is equivalent, 


If one value of r which satisfies (3) is positive, the modal term is the largest 
integer contained in (r+ 1). 


Imagine for the moment the terms of the hypergeometric series to be ordinates 
of a continuous curve. If 79 be a value of r satisfying (3), then the ordinates at ro 
and 79+ 1 are equal. The mode (or antimode) of the curve if bell- (or U-) shaped 
lies somewhere between 79 and 7)+1. When the standard deviation of the curve is 
fairly large, a good approximation to the position of the mode is 7+4. We shall 
take this to be our definition of the modal position of the hypergeometric series. 
The modal term is the nearest integer to r9+ 4. This is measured from a point 
distance ¢ before the first mid-ordinate, hence, referred to start of histogram, the 
distance of the mode is 79. Of course, a more accurate determination of the position 
of the mode may be obtained by fitting a high order parabola 


Y =Ag +124 ag27 +... 
to the mid-ordinates of the series around the mode and solving (dy/dz) =0. 


Let us return now to the fundamental equation (2) which we will proceed to 
solve. It may be put in the form 


84 so ®t B= 2)e—-(y—1) , ,2(@— 1) 8-1) 








1 27 (2) a—1 z—l 
ydz c\et+l/ ,, , @+8—-2)et(y-1), s(@-I(E-)) 
“+i #2+1 
Let saz | (e+ B-2e+(y—V)}?_ 4 @-V(B-1) 
io “a+1 ) x+1 : 
Cc 
~ (a+ 1)* a, 


where, on simplification, 
A =[«?(a— 8)? + 2a (a+ B)(y+1)—2(@8+y)} +(y—-1)*). 
Three different cases arise according as A is positive, negative or zero. These 
will be considered separately. 
I. A positive. 
The denominator of the fundamental differential equation will split up into the 
two real factors 


E + Test +8 - 2)0+(y-1) +vA) | 


x |2 +5 aos {(a+B—2)x+(y-1)- val | = (2+ F,)(z+ Fp). 
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+@siIpt  @+h)@+h) 








Wen ty 3 {(e=4) 2ac ee eR 
ydz c \e+1 


2/1), # (4, 2) 
~ ¢\etl) * (@ +1)? 12@+h 24+F,)’ 
where 2A =—(e+2)— 7 2(e+ I)(@B—7) + (+2) (a+ 8.0741) 


1 Serer s 
2B=—(e+2)+ VA {2(a+1)(@B—y)+(e+2)(@+8.c2+y74+1)} 
and e=y—a-B-1. 
Integrating the original differential equation, we find 
_2 (752) 4Ax 4Bxr 
y= ye e \l+z [4h] ot ear et. 


This is a product of an exponential term into a Pearson Type I or VI curve, the 
former when F, and F2 are of opposite sign and the latter when F, and F; are of the 
same sign, 





Il. A negative. 


The denominator of the differential equation can now be expressed as a sum of 
two squares, namely, 
c . c*D 
oo ~2)0+ yh) eS 
|2+ 2(@ +1) (a+ B—2)x+(y » | + t(e@+1)’ 
where D =|Aj. Then 


1 dy (5°) 4a z(a+8—y-—1)+(@—-1)(8—-l)c 





y dz c\l+a (a +1)?” o eD : 
wat 4(@+1) 
ane ey ¢ Oh of 5. tis 1 
where . R F+eeanlets 2)x+(¥ 1]. 
_ p2(a#+1) 
Let [= R er) , then 
1 dy __VD(Q—~«) _2s(e+3) 3% , 
ydt (l+2)? (l+a? “14+ 14+?’ 


a 8a 1)(8-1) + e+2 (a+8-2)e+ -1)] 
*"oaeni" 2(a+1) ‘ ov SEs 


Hence, on integration, 


where pv 





ji-z | ¢ “ €+2 
“\ ) lax $ — sr. 241) “4 
y=Yyoe \( + x)"} qd +f?) ( ) a” arc tan ¢ 
2(¢+1), 
2/1--27\ vy are tan 7 R 
- z c,/D 
RR =(i32) é ” 
= Yo @ ne 
‘ CD wrt? 
+4@+1)? 


This is a product of an exponential term into a Pearson Type IV curve. 
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Ill. A=0. 
For this case 
ldy 2(@-1), 4 2(@+B-y—-1)+(@—-1)(B-I)e 


ydz c(@+1) @+1? ; 
ydz e(@+1) (@+1) 2+ a@eea (@+e-2 2407-1) 





and we readily find 
4x (e+ 2) 


sini eee) e+ fier) {(a+B—2)a+(y—- »)| wine ds 


4acv’ 


x cea @ +1) ((a+8-a)2+6-} | 
where v’ =((a—1)(B—1) + {@+B—-2)x+(y—1)} (e+ 2)/2 (a+ 1)]. 


This is a product of an exponential term into a Pearson Type V curve. 


IV. Type III Curves. 
When either @ or 8 is unity, the differential equation reduces to the fairly 
simple form 








--7(E* _ 4a(e+2) 1 (i) 
= c\l +2/ (a +1)? F Cc = eat ORES 
e+oyy et8-22+y-1) 
which gives the solution : 
2 /1-2\_ te 22- 
Te heer (x +1)? — 
Y= Yoe ( mt aes (a+B—2a+y-— | eevces (11). 


Since the hypergeometric series is symmetrical with respect to a and 8, we may 
take @= 1. (ii) then reduces to the final form: 
(y- 4) 


at )s . eed PET 
y=ye °'t* [2+ Sy l@-De+(-1)] ; O0<z<eu. 


(a—1)a+(y-1) 


is positive. 
“2+ I 


This is a Type III curve when the quantity ww =c 
When negative, the solution of (i) is 





~3 (5) 4a (y~a) 
y=yoe e\l+2 [y’ — 2] PEEP O<ccy, 
where y’ = ||. This curve is not of Type III because its range is limited. It may 
be looked upon as a product of an exponential term into a Pearson Type VIII or 
IX curve, which we may symbolise as Gy; or Gyx. 

When the range of the series extends beyond +f’, as would be the case were a 
and ¥ positive, fractional and less than unity, the equation of the curve corresponding 
to terms of the series beyond w’ is 


2/1-2 (y-a) 
-- z — 4x A 
y=yo'e (tse) [e-y] & th, pl<zcom. 
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Hence, the complete solution of (i) when y is negative is the following: 


2 /1-2) v-3) 
——— }z -—4z ~-—* 
(5) [w’ — 2] (S+iP O<z<w, 


y=yie ° 
2(1-« ge 
BR Fae gt ae © tc +1)? z>y 
When z= w’, y becomes zero. We cannot, therefore, expect a good fit near this 
point. 


When a and y are negative and fairly large numbers, the terms of the’ series 
beyond the point y’ are small and generally negligible compared with the modal 
terms. For such cases the second part of the solution of the differential equation 
will not be required for computing purposes. 


Sufficient conditions for a Type IIT curve are, therefore, 
(a) a or B or both unity, 
(b) [((@ —1)#+(y—1)]/(x + 1) positive (for 8 = 1). 


A necessary condition is that the numerator and denominator of the fundamental 
differential equation have a factor in common. 


The curves which arise out of the solution of the differential equation 
ldy  az*+2bz+d 
ydz Az*+2Bz+D 
have the form y =e P(z), 
where P(z) isa Pearson type curve. These may, therefore, be divided into three 
main types, namely, 
Gy: y = ye” 2” (a—2z)*, O<z<a, 
Gyi: y = ye? (2 — a 2-4, adgzegon, 
—y arc tan z/qa 
Giy: y = ye? ; —o<eze+o0. 
1V y= (1+ 2/ay’ 


Integrating throughout the range for curves of type Gj, we have 


N=yY [ervear (a—z)dz 
0 


oa 3 (-¥ t P aed —z¥d 
= Yo p' | 2"tt(a— 2) dz 
o ft! 0 


alist) (r4+)) war 

= r+s+1 SF Mit bcs Ae : 
You T(r +s+ 2) li+s+2)"" 

(r+1)(r+2) og 

2 (r+s+2)(r+s+3)? re 
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The higher moments are given by the partial differentials of N with respect to 
p, 1.e. 
. oN 
Nps =(-)* Op* # 
The moments of the curves Gy; and Gy may similarly be expressed by infinite 
series. I have attempted to use these series in a number of numerical examples but 
have found that they converge far too slowly to be of practical use. 


Examples of the Above Method of Curve Fitting. 
I. Series F (30, 30, 111, 75) with c= 1. 
The differential equation of the curve for which d log y/dz is equal to 
Ayr/4(Yriat yr), 7=0, 1, 2, ..., 
y, being the rth term of the series, is the following: 


5 3 4 SS. ee 
~ 7° ¢+4322,090 z+ 83°392,196" 
Its solution is 
Y = yor ** (z + 4°322,090)3 3.457 (z + 83-392,196)- 842,282, 
This curve belongs to type Gy. 
The equation giving the position of the modes of the curve is 
2(%—1)+2 {(a@+B—2)x—(y—-1)}+(a—-1) (8-1) a=0. 


This equation also gives the position of the modes of the series. It has two roots, 
namely, 9°168,914; — 275°16891. The latter is well outside the range of the series 
and must accordingly be rejected. Hence, the true position of the mode is 9°168,914 
(i.e. the tenth term) measured from the start of the histogram. 


The ordinates of the curve corresponding to the mid-points of the rectangular 
blocks representing the terms of the series are obtained by putting 

z=r+} (r=0, 1, 2, 3, ...) 
in y. 

The constant y% may be found by equating the value of one of the mid-ordinates 
of the curve to the corresponding term of the series. The best results will be 
obtained by selecting the mid-ordinate nearest a point of inflexion of the curve 
because here the area under the curve corresponds most close'y to the numerical 
value of the mid-ordinate. This method has the objection of throwing too much 


weight on a single ordinate. ~This objection will shortly be cemoved when other 
methods of curve fitting are discussed (§ 2). 


When we require merely the probability integral of the series (or the curve) 
and not the actual value of the partial sums (or partial areas), the question as to 
the choice of yp does not arise, because a probability integral measures the 
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proportional values of the frequencies and not their absolute values. In most of 
the subsequent examples the constant yo is calculated by equating the whole area 
of the curve (found by quadrature) to the total sum of the series (found mechanic- 
ally). This is done to compare. the Probability Integral of the series with the 
Probability Integral of the generalised Pearson curve which fits it. We do not 
know the actual sum of the general hypergeometric series and no one has yet suc- 
ceeded in finding a simple and accurate expression for it. One method of obtaining 
an approximation to this sum is to calculate yo by equating the ordinate of the 
curve at a selected point to the corresponding term of the series. How good this 
approximation is may be judged from the subsequent examples. 


In this first example, yo is calculated by equating the ordinates at a selected 
point. 



































| l } 
| Terms of | Mid-ordinates | Areas under Terms of Mid-ordinates Areas under 
Series of Curve Curve Series of Curve Curve 
a = 
1-000 1°702 1*866 58°705 59°024 | «= 59°176 
6°081 7°202 7°539 43°152 43°496 43°638 
19°567 20°703 21°166 31°030 31°373 _ 31°496 
44°397 45°067 45°503 21°877 22-198 22°299 
| 79°328 79°595 79°826 15°152 15°438 15°517 
119°717 119°496 119°421 10°327 | 10°572 10°632 
| 158*031 157°618 | 157-252 6937 | 7139 | 7183 
|  187°553 187°124 | 186°573 4599 | 4760 | 4937 | 
203°994 203°639 203°042 3°013 3°138 3°160 } 
206°280 206°015 205°490 1:952 2°047 | 2°062 
196°095 195°916 | 195°535 1°252 1°324 1°334 
176°795 176°702 176°484 “796 *847 “896 
152°250 | 152°250 152°178 “502 | "539 *Db44 
125°971 | 126°068 126°107 314 *340 *343 
100-628 | 100°818 | 100-928 195 | *213 "215 
77°926 78°198 | 78°342 *121 | "134 °135 
| | 
| 
II. Series F (— 30, — 50, 100, °5); c=1. 


The equation of the curve which fits this series is 





121928 tan 38 
e9 V(1379) are wan (13879) 
y=yoe * ce 
1379) 9 
bu 
(R . >) 
where R=2+ 48. 


Rimoae) 77 24°870,2702. 
This curve belongs to Gyy. 
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In this example, and in all subsequent ones, the constant y is determined by 
equating the area of the curve, found by quadrature, to the sum of the series. 































































































Terms of Mid-ordinates | Areas under Terms of Mid-ordinates | Areas under 
Series of Curve Curve Series of Curve Curve 
1-000 1°911 2°124 37°658 37°766 38°098 
7°500 8°840 9°324 17°899 18°558 18-921 
26°380 27-000 27°538 7°069 7°800 8°043 
57°932 57°173 57°201 2°337 2°854 2-976 
89-219 87°608 86°861 *650 “923 “974 
102°602 100°864 99-994 °153 *268 °285 
91°609 90°179 89°536 “030 “O71 76 
65°188 64°446 64°423 “005 *016 018 
-_ ! YX Histogranz of Series F'(30,-50,100,-5) 
! \ 
' _2,, 421,928 3R 
. \ Gurve: y= ye 3” @2mer SFist 
! 2, 1379) 224 
[54 (R + ——) 9 
fe / . 9 
= 1 
S 1 
2 ; \ R=z+ 38 
a i rs 
Fe 5°h 10 
: eN 
S 
ve \ 
A ' 
254 " s 
/ , 
| 
' 
i 
0 fT I : Tt 
re) 1 2 3 4 5 6 v4 8 9 10 ii 12 13 
———z . 
Fig. 2. 
II. Series F(— 30, 60, —81, 4); c=1. 


The equation of the curve which fits this series is 
y = yee? (z + 7388,310)* 5.8 (61-888,310 — 2) 0572 


which belongs 


to type G,. 


Z(mode) = 7°036,596. 











Terms of Mid-ordinates 
Series of Curve 
1°000 1°993 
7°407 11°184 
27°299 29-002 
66°655 67°169 
121°133 120°184 
174°515 172°667 
207°300 205°218 
208*485 206°550 
180°898 179°374 
137°303 136°452 





Areas under 
Curve 


2°318 
11°512 
29-901 
| 67°817 
| 120°174 
172°412 
203-876 
205°333 
178-696 
136°405 











Terms of | Mid-ordinates | Areas under 
Series of Curve | Curve 
92°107 92°054 92°405 
55-036 | 55°637 56°115 
29°462 | 30°381 30°803 
14°189 15°094 15°388 j 
67165 6°863 7040 
2°421 2°870 2°963 
*860 17108 1°152 
“276 "397 “416 
“080 131 “138 
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Scale of Terms of F 


Scale of Ternes of F 


oul its Bie: Histofram of Series F (-30,60,-81,3) 


175 b Curve: y=y oe (z+ 1-388,310) "> 599,628 
X (61:888,310-z) 95909972 



























































150} 
125 | 
a 
100} 
2 
15 
50} 
25 J i Pie 
0 T — 1. s| 
o 1 2 3 4 5 6 yi 8 9 10 ti 12 13 14 15 16 17 
—_——> Z . 
Fig. 3. 
IV. Series F (— 20, 20, — 200, 5); c=1. 
Terms of | Mid-ordinates| Areas under Terms of Mid-ordinates| Areas under 
Series of Curve Curve Series of Curve Curve 
1°00 3°14 3°53 1609°69 1595°46 1589°87 
10°00 16°185 17°48 1155°28 1148°56 1149-18 
50°13 61°59 64°67 710°59 713°07 717°61 
167°09 181°39 186°50 372°16 381°60 386°96 
414°53 424°38 430°63 164°19 175°69 179°89 
812°15 812°09 816°12 60°03 + 69°39 71°88 
1301°52 1291°04 1289°33 17°74 23°38 oF 24°56 
1744°30 1727°35 1718°82 4°08 6°68 Ti 
1982 °67 1963°62 1951°16 “69 1°61 1°76 
1927°60 1909-41 1898°31 
| 























The curve which fits this series is 
y= yoe** (2 + T°747 944)72-282877 (42-914,611 — 2)8%272,679, 
(mode) = 8°816,981. 
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: X (42-914,611-z) 99°2 72,679 
1400} ‘ 

' 

I 
1200} ' 

! 

wl 
1000 }} 9 
:, 

800} ’ 

' 

! 
600 H ' 

! 
400 ; 

' 
200 H it ' 

' 

' 

o T T . T 
0 4 2 4 =: 7 i | a | er |r 7 ey 
—>2z 














Fig. 4. 

















O. L. Davies 71 


V. Series F (1, — 60, — 65, °9); c=1. 
Equation of the curve which fits the above series is 

Y = Yo €~ 78% (63-631,579 — z)*960, 
This curve belongs to type Gx (see p. 64). 






































| Terms of | Mid-ordinates | Areas under Terms of Mid-ordinates | Areas under 
Series of Curve Curve Series of Curve Curve 
| 1:°000,000 “993,952 *995,309 *003,534 *003,586 °003,595 
| *830,769 *826,180 *827,323 002,762 -002,806 -002,813 
| 689,279 *685,840 “686,802 “002,150 *002,187 *002,193 
| °S71,117 *568,580 *569,389 “001,666 -001,697 -001,702 
| 472,553 “470,719 “471,397 “001,285 “001,311 “001,314 
*390,437 *389,146 “389,715 *000,987 -001,008 “001,011 
| 322,111 "321,235 *321,712 000,753 -000,771 -000,773 
-265,332 -264,769 -265,169 “000,572 “000,586 "000,589 
| °218,213 °217,884 *218,219 *000,432 -000,444 Pecos hn 
| 179,164 *179,008 °179,287 *000,324 -000,333 “000, 
| -146,850 146,819 147.051 000,241 -000,249 000,250 
"120,150 -120,205 *120,398 “000,178 "000,185 <a 
*098,123 -098,234 *098,395 000,131 “000,134 “000, 1: 
‘079,979 “080,126 “080,260 “000,095 -000,098 “000,098 
“065,060 ‘065,227 “065,338 “000,068 “000,071 “000,071 
-052°814 | 052,989 053,081 “900,050 000.051 *000,051 
"042,779 042,954 "043,030 “000,034 “000,037 “000,037 
*034,572 -034,742 *034,805 “000,024 *000,025 “000,025 
027,874 | 098,034 | -0298.086 “000,016 “000,018 -000,018 
022,418 | -022,566 "022,608 “000,011 -000,012 “000,012 
017,983 | 018118 | -018,153 “000,007 -000,008 |. -000,008 
"014,386 | -014,507 | = 014,536 "000,005 “000,006 -000,006 
“011,476 | ‘011,584 | -011,608 “000,003 -000,003 “000,003 
009,128 | 009,222 “009,241 ~000,002 000,002. | poe ny 
°007,237 | °007,319 *007,335 000,001 | -000,001 -000,001 | 
| 005,719 |  -005,790 “005,802 “000,000 “000,000 “000,000 | 
| 004,504 "004,565 “004,575 
VI. Series F (— 100, — 100, 1, 3); c=1. 
The curve which fits the series is 
ss 808 
2R 808 tan SN 2R aie oe 
aa ~ +. 972 te tan ~ S65 / (Re F 20402 . 
aie 9 
where R=z—101/3. Znoae) = 41°835,570. 
Significant | Corresponding Significant | Corresponding : 
Terms of Mid-ordinates — egies Terms of Mid-ordinates a rs 
Series of Curve ——- Series of Curve 10-5 
x 10-% x 10-% = x 10-8 x 10-8 
“000,002 ~000,004 “000,004 “165,172 “164,379 “163,910 
“000,007 *000,014 “000,015 138,596 *137,167 “137,060 
-000,027 000,046 | 000,048 “107,318 -107,009 *106,997 
“000,093 | 000,137 “000,142 *076,710 *076,730 -076,956 
*000,287 *000,379 *000,393 *050,631 °051,513 *051,719 
“000,803 -000,977 -001,008 -030,864 031,366 “031,656 
“002,048 “002,341 -002,401 017,380 017,935 *018,145 
“004,762 -005,192 “005,300 ‘009,041 “009,542 “009,690 
“010,110 *010,658 *010,828 *004,345 004,731 *004,825 
“019,630 “020,215 020,451 “001,929 -002,190 “002,243 
“034,900 “035,391 “035,669 “000,791 -000,949 “000,976 
*056,888 *057,142 057,404 *000,300 *000,386 *000,399 
-085,104 *085,212 “085,183 -000,105 “000,147 000,153 
“116,959 ‘116,539 *116,503 -000,034 -000,053 “000,055 
*147,794 *147,110 *146,830 *000,010 *000,018 *000,019 
“171,857 “171,012 “170,664 -000,003 -000,006 -000,006 
*184,023 *183,113 *182,497 *000,001 -000,002 *000,002 
‘181,571 *180,676 -180,092 
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The condition z <1 must be satisfied by all convergent infinite hypergeometric 
series; but for finite series (i.c., when @ or 8 or both are negative integers) « may 
have any value whatsoever. A finite series with 2>1 may be reversed into one in 
which the fourth element is less than unity. Thus, if the series be 





—1 1 
F(-¢’, B,y,2)=1- $2 += ey Os ) a2 USWA 


where @’ is a positive integer, the last term is 
iy BB+1)...(B+@-1),, 
v(y+1)...(yta’=1) ” ~* 
Reversing the series, we have the equivalent one 


| a ytea’—11 a’ (a’—-1)(y+a’—-1)(y+a’—2) 1 | | 
X|"" Tip+e’—-1z 21(B+a'—1)(B+a—-2) 2 | 








1.e., 
F (-a’, B, y, x) 


=(-¥ SA +1) -- (8 +a’ —1) 
y(y+1)...(y+@ —1) 





1 
Pia, -(7+@ -1),-@+e'-0), }, 
Le. F (a, B, y, x) = const. x F(a, -yt+a+1,-—B+e+l, =), 


where @ is a negative integer. We notice that e=y—a— 8-1 is the same for 
both series. Replacing z of the fundamental differential equation ((2), p. 61) by 
— z—(a@—1)€c results in the corresponding equation for the series 


1 
F(a,-y+a+1,-8+a+1,-). 


Hence, nothing is gained by considering separately finite series in which « is greater 
than unity. All results which apply for finite series with «<1 apply also, with a 
slight modification, to finite series with # >1. 

In the above examples the curve found by equating - “2 to Ay,/$c¢ (Yr4a + Yr) 


of the series, where y, is the rth term, fits the auidhenns part of the series 
tolerably well. There is, however, a slight deficiency about the mode and excess 
towards the tails. The goodness of fit of this system of curves improves rapidly 
when the number of significant terms of the series increases. 


§2. An Alternative Method for the Fitting of Generalised 
Pearson Curves to F(a, B, y, 1). 


Consider firstly the series 


a8 , a(a+1)A(8 +1) 
l!y : 2!y(y¥+1) Cas, weed eneeebeec0seeeeee (i), 


in which @ is unity. When its standard deviation is not small, a Pearson curve P (z) 


1+ 
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may be fitted to it with sufficient accuracy for most statistical purposes*. Let, then, 
P,(z) be the Pearson curve having the same first four moments as (i). Let 2 be 
the value of z which corresponds to the mid-ordinate of the first block of the histo- 
gram of series (i). The abscissa of the mid-vertical of the rth block is, therefore, 
P (a +1), the grouping unit c being taken as unity. P,(z+7) also gives quite 
closely the numerical value of the rth term of the series (i). Hence, the expression 
a’ Py(2, +17) gives quite closely the numerical value of the rth term of the 
general series 

a8 a(@+1)R(8+1) 

my Dy (y +1) ip Se 


Accordingly, the generalised Pearson curve 


a1 P, (z) = const. 2? Pi (2) 





will give quite closely the ordinates of the general series. 


We know that the areas under the Pearson curve P (z) accord more closely to 
the terms of the series (i) than the ordinates *, but owing to the presence of the 
factor «?, or what is equivalent, e* where p = log, #, this property no longer holds 
for the generalised Pearson curve. We can, however, apply a small correction so 
that the areas under e P,(z) will accord to the terms of the series (ii) at least as 
closely as do the areas under P;(z) accord to the terms of (i). This correction is 
obtained as follows : 

Let Y, denote the ordinate of a P,(z) which corresponds to the mid-point of 
the (r+ 1)st block of the histogram of the general series, and let y, be the corre- 
sponding ordinate of P,(z). Then Y,=a7y,. 


























+b---------------— 


ee 
~ 


i ee 


1 “+1 


Let NV, denote the (r+ 1)st term of the series (ii) and n, the (r+ 1)st term 
of (i). Then, V,= 2"n,. 


When the standard deviation of the series is large, the following formula 
expresses with sufficient accuracy, n, in terms of y,: 


Mp = gy (ZQHp + Yat Yrpa) -oreceeeecerseececeeeseeees (iii). 


* See O. L. Davies, Biometrika, Vol. xxv. pp. 295—322. 
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The area under the generalised Pearson curve corresponding to the (r+ 1)st 
term of the series (ii) is 


aha ee eg ee eee ene eee eee (iv). 
Now N, =a7n,= 24 ©” (22y, + Yat Yrs) 
= 3a (22 ig + i z+ Y-41/x) oe ececcccccceecccccccceces (v). 


Hence, the correction to be applied to the (r + 1)st sectional area (iv) is 


c= -#([Yu(1 — «)-+ Y,41(1—- 1/2)] 


ss — "te Be > Ao, Renee erie eee (vi). 


If, therefore, Na, oe oS whole area of the curve P,(z)«? up to and 
t+1 
including the (¢ + 1)st term (=| if zde), the correction for Na, is 
0 


l-« 


“W4er 3 ( Y,.4- xY,_1) 


EG f- 


aL =| 2 Y,(1—2) + (Yu +¥,) —(Yo +2¥.)| an (vib). 


If there is high contact at the start of the curve, y_, and yp are very nearly zero 
and the correction simplifies to 


| 2, Y, e(l—a)+(a#¥i+ Yi.) 


l-«@ 
24a 
t 
Now > Y,=Na,+ E£, where E is a small correction found by applying the 
r=0 
Euler-Maclaurin theorem. 


The correction to be applied to Na, so that it may give a closer approximation 
to the sum of the first (t+ 1) terms of the series (ii) is, therefore, 


C,= Cag) [(Noy + E) (1 — 2) + (Yet Yoss)] cceceseeeees (vii), 


E is of the second degree of smallness compared with C,, and when « is not 
small it may be neglected oli C, may then be written in the form 


as acme ie +(«Y, +5 — Cece cecesscesesccetes 
which may be calculated os readily. 
If there be high contact at both ends, the correction for the whole area of the 
(1 — a)? 
24a N. 


. A close approximation to the sum of the whole series 


ni +25 =", 


curve reduces to 


— 


is, therefore, 


24a 


where N is the total area of the curve Y =a? P;(z), provided « is not small. When « 
approaches unity, the correction tends to zero. 
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If the simple formula (iii) does not give sufficient accuracy, we may apply the 
higher quadrature formula 


nr, = argo [5178y, + 308 (Yr4a 8 Yr—-1) —-17 (Yr+2 + Yr-2) |. 
The correction to be applied to the (7 + 1)st sectional area is 


1 1+ 
aac [308 (Yea — —#Y¥,.)—17- *2(F,a—2Y,2)]. 


Accordingly, the correction to be applied to the area Na; is 


(1-2)? es 
“syne Ne | 308 - 17 2) 


+o ae e¥)—17(*2*) (Yon + Yeu) —2°(Vea + Yo] 


57602 


When there is high contact at the start of the curve, the last term approxi- 
mates to zero. Further, when there is high contact at both ends of the curve, the 
correction for the whole area is 


- Frebe | 308 (Ye +0¥.)+17(“** =) (r i+ Vso +(Ya+ Yo). 





—aP% 
ed ca 


and accordingly, an approximation to the sum of the general series (11) is 
(1- ‘ a el + a)? 
" then 2 me a 
N E + at 308 — 17 *— : 


where WN is the total area of the curve 2? P;(z). 


wi “ 
i 2 


> 


The following results will give an indication of the magnitude of the correction C 
for the whole area when there is high contact at both ends. 


n=, C= (4889)"/, =2°/., 
a= 75, C= 34°, 
a =°9, C= -046°/.. 


In the following examples, the areas under the curves obtained by both methods 
are compared with the terms of the series. 
The curve obtained by applying the first method, i.e. from the equation 
ldy _ Ay, 
ydz c 
: 9 Yet Yrn) 
(§ 1), will be referred to as curve A, while the one obtained by applying the above 
method will be referred to as curve B. 
Examples. 
I. Series F'(30, 30, 111, °75); c=1. 
Equation of curve A is 
y = yor ** (2 + £322,090) 4 (2 + 83:392,196)-H 
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Equation of curve B is 
Y = Yo e~ 878% (2 + 4°408,767 4848 (z + 90°876,152)- 07127, 
The origin of both curves is at the start of the histogram of the series. 



































Terms of Areas Corrected Terms of Areas Corrected 
Series under A Areas under B Series under A Areas under B 
1°000 1°866 2°143 43°152 43°638 43°174 
6°081 7°539 7°592 31°030 31°496 31°036 
19°567 21°166 21°147 21°877 22-299 21°875 
44°328 45°503 45°344 15°152 15°517 15°146 
79°397 79°826 79°539 10°327 10°632 10°321 
119°717 119°421 119°114 6°937 7°183 6°932 
158°031 157°252 157°067 4°599 4°937 4°596 
187°553 186°573 186°615 3°013 3°160 3°010 
203°994 203°042 203°317 1°952 2°062 1°951 
206°280 205°490 205 °922 1°252 1°334 1°252 
196°095 195°535 196-001 “796 "896 “796 
176°795 176484 176°865 “502 “544 *502 
152°250 152°178 152°392 “314 *343 *314 
125°971 126°107 126°115 *195 *215 "195 
100°628 100°928 100°745 *121 *134 °121 
77°926 78°342 78°005 eet na ae 
58°705 59°176 58°752 
Il. Series F(— 30, — 50, 100, 4); c=1. 
Equation of curve A is 
2172 121928 3 _ 58 
_ a: 58\2 1379) ~ 27, 97879) “” (1379) (+ ) 
y=yoe | (2+) +e 1 e 


Equation of curve B is 
y= Yo & “693,147 2 (2 + 6°051,499 )27°787,858 (36°453,380 =e 52000 638. 
origin of both curves being at the start of the histogram. 



































Ss ees pele sade ——$—$—$_—— 
Terms of Areas Corrected Terms of Areas Corrected 
Series under A Areas under B Series under A | Areas under B 
Se Nhs 2 a _ het Mens Pe NERS Ne eee 
1°000 2°124 1°506 17°899 18°921 17°909 
7°500 9°324 8°157 7°069 8°043 7°072 
26°380 27°538 26°523 2°337 2°976 2°341 
57°932 57°201 57°502 *650 "974 *653 
89°219 86°861 88°893 *153 "285 "154 
102°602 99°994 102°742 “030 “O76 031 
91°609 89°536 91°946 “005 “O18 “005 
65°188 64°423 65°408 
37°658 38°098 37°728 
III. Series F'(—30, 60, —81, 4); c=1. 


Equation of curve A is 
Y = yoo? (z + 7°388,310) 88 (61:888,310 — z)300.872, 

















Equation of curve B is 


O. L. Davigs 


y= yor (z 2s 7°531,867 )?7 69.947 (37°648,329 im 2)2#°085,187, 


origin of both curves being at the start of the histogram. 





























Terms of Areas Corrected Terms of Areas Corrected 
Series under A Areas under B Series under A Areas under B 
1-000 2°318 1°937 92-107 92°405 92°144 
7°407 11°512 10°646 55°036 56°115 55°074 
27-299 29°901 30°477 29°462 30°803 29°478 
66°655 67°817 70°564 14°189 15°388 14°191 
121°133 120°174 123°146 6°165 7°040 6°164 
174°515 172°412 174-767 2°421 2-963 2°420 
207°300 203°876 206°642 "860 1°152 “859 
208°485 205°333 207°814 “276 “416 “276 
180°898 178°696 180°567 “080 “138 “080 
137°303 136°405 137°235 nee | ae 
IV. Series F (1, — 60, — 65, ‘9); c=1. 


Equation of curve A is 


Equation of curve B is 


y= Yq e105, 2888z (63°631,579 a g)# 286,150, 


Y = yo’ e~25:8805: (63-3.49,729 — 2)# 98.824, 
origin of both curves being at the start of the histogram of the series. 





| 


| 





Terms of Areas Corrected Terms of | Areas Corrected 
Series under A Areas under B Series under A Areas under B 
1+000,000 “995,309 “999,980 “003,534 -003,595 “003,534 
|  °8380,769 *827,323 *830,630 002,762 | °002,813 *002,762 
689,279 "686,802 "689,193 002,150 | 002,193 "002,150 
“571,117 *579,389 *571,058 *001,666 “001,702 *001,666 
*472,553 °471,397 *472,519 *001,285 “001,314 -001,285 
*390,437 *389,715 *390,421 *000,987 °001,011 “000,987 
*322,111 *321,712 *322,105 *000,753 *000,773 000,753 
*265,332 *265,169 *265,334 *000,572 “000,589 *000,572 
*218,213 *218,219 *218,221 000,432 “000,445 *000,432 
"179,164 *179,287 “179,174 *000,324 “000,334 “000,324 
146,850 *147,051 146,863 “000,241 *000,250 *000,241 
*120,150 *120,398 “120,162 *000,178 “000,186 -000,178 
098,123 “098,395 “098,136 *000,131 “000,135 *000,131 
079,979 -080,260 079,990 “000,095 -000,098 -000,095 
“065,060 ‘065,338 “065,071 “000,068 -000,071 “000,068 
052,814 -053,081 -052,823 -000,049 “000,051 “000,049 
*042,779 “043,030 “042,786 000,034 “000,037 000,034 
*034,572 *034,805 *034,579 *000,024 *000,025 *000,024 
"027,874 “028,086 *027,879 -000,016 *000,018 *000,016 
-022,418 “022,608 “022,422 “000,011 “000,012 “000,011 
017,983 018,153 ‘017,987 -000,007 -000,008 “000,007 
014,386 014,536 014,389 “000,005 “000,006 -000,005 
011,476 “011,608 011,479 *000,003 “000,003 *000,003 
“009,128 “009,241 *009,129 *000,002 *000,002 *000,002 
*007,237 “007,335 ‘007,238 *000,001 “000,001 *000,001 
*005,719 *005,802 “005,720 *000,000 “000,000 *000,000 
“004,504 *004,575 “004,504 see ““ wi 
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There is very little difference between the fit of the two curves at points up to 
the mode, but after the mode and especially towards the tail, the curve B is a 
distinct improvement on curve A. In the last example, where the curve is J-shaped, 
the former method gives far better results than that developed in § 1. An added 
advantage is that the fairly troublesome type G,y does not arise. It must be 
remembered, however, that this method is applicable only when the series with « 
replaced by unity is convergent. 


§ 3. Evaluation of the Incomplete Integral ie e°? P (z) dz. 


In the previous paragraphs we have shown that when the number of significant 
terms is fairly large, a general hypergeometric series F(a, 8, y, x) may be fitted 
quite closely by a curve of the form 


ST PERE wininsn sth Cileawideasssasblpsbas (1), 


where P(z) is a Pearson type curve. Hence, a number of terms of this series can 
be represented by an integral of y taken between the proper limits. When this 
curve is not very skew, we may often evaluate the integral by using the method 
outlined by Pearson in Biometrika, vol. 1. p. 390 et seg. This consists in expanding 
(1) about its mode into a series of the form 





3 (Sa) 2 
y =Yme 2m (1 +bg 23/314 bgzt/4! + ...), 


where u= log y, and z= ™m, the position of the mode of the curve y. The coefficients 
are expressed by Pearson in the form 


bs = a3, be = (dg + 10a3"), 
ba = MM, bz = (az + 35a3(14), 
bs = U5, bs = (dg + 56d3d5 + 35a4?), etc. 
d'u . 
where a;= (ca), (s= 1, 2, ...). 


These coefficients become increasingly more complicated. 


When finding the partial integrals of the curve, i.e., integrals from the start of 
the curve up to a point z, we may safely take the lower limit to be — 0. This 
enables us to express the partial areas of (1) in terms of incomplete normal moment 
functions. These have been tabulated up to the twelfth. (See Tables for Statisticians 
and Biometricians, Pts. 1 and 11.) 


If J, and /, are the limits to the range of the curve y; the complete integral is 
ly +00 
N= ydz= | ydz 
Jl, J—@ 


_ Ym (29) 1 by a4 be 
S V(- de) Sag" 48a? a 


This method will be referred to as Method A. 
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Example. 
y = yor *2 (40 — z)™. 
The total area of the curve found by quadrature is 79345°112y910". 
It is convenient to take yp = 10-™. 
du 30 60 
as ee 
The mode is, therefore, at the point z= 10. The constants 
dtu E 30. ~=—s«60 

y= (Fay @- D's HBB) 

are readily found. Substitute their values in (2) and we have 


N =79668'590 (1 — 017,14876 + °011,51807 
+ 004,69824 — -003,31286 + 000,90737 — ...) 


Now u=logy and 


= 79402°661. 

This is in excess of the true value by approximately ‘072 °/,, which is sufficiently 
close for most statistical purposes. 

Method A has been applied* with varying success to the evaluation of the 
incomplete Beta function. The results obtained there hold with a slight modification 
for integrals of the curve e~??P (z) because this curve and P (z) behave in very much 
the same manner. It has been shown that the curve obtained by Method A fits 
a Pearson type curve very well at points lying within + 1'5e of the mode, but may 
deviate appreciably at points further removed from the mode. This is also the case 
with the more general curves. 

In the next three paragraphs we shall develop other and more accurate 
methods of curve fitting. 


§ 4. The Fitting of Pearson Curves to Generalised Pearson Curves by Equating 
the Corresponding Logarithmic Differentials about the Mode. 
Method A may be generalised somewhat by expanding 
y= yor P(z) 
around the mode into a series of the form 
Y = You (2) {1+ byz + dg z2t ...} ccc ceeceeeeseneceeees (i), 
where u(z) is a Pearson curve of the same type as P(z). 


When P(z) is a Type I curve, y may be expressed in the form 


Y= Yo Pe (A — SYP nn erecrcrecsesceerensseveeees (ii). 
Let (ii) be represented around its mode “m” by the curve 
Y = Yo (ZF M)? (A— IM — ZT es ececserersererereeses (iii), 


where p und g are at our disposal. 


* Biometrika, Vol. v1. (1908), p. 68; Introduction to Incomplete T-Function Tables (1922), pp. xviii— 
xix; Tables for Statisticians, Part 1. (1931), pp. cexxxi--cexxxii; Tables of the Incomplete B-Function 
(1934), p. vi; Soper, Z'racts for Computers, vit. (1921), pp. 43—44; Wishart, Biometrika, Vol. xrx. 
(1927), p. 438, 
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Now Yy = yo’ e-?? (2 +m) (a—m—z) 
= ye" (m+ 2)? (a—m—2)V, 
where V =e-?? (ni + 2)? (a—m—z)4 
and m the distance of the mode of (ii) from the start of the curve. 
Let v=log V, 
then (dV/dz) = V (dv/dz), 
(2 V/dz*) = V (d?v/dz*) + V (dv/dz)’, etc. 
Choose p and q so that SO ag WHE 5. dvacavncansnkipuralivadatecyt a (iv a), 
CI is sa schetns etnies ots (iv b). 


The two curves (ii) and (iii) have the same range. Condition (iv a) implies that 
they also have the same mode and condition, (iv b) indicates that the two curves 
coincide to the second order around the mode. 

By Maclaurin’s theorem 

y = yo) (m + 2)? (a — m — 2) 


wey . TAR 1 (@o\ , 
" {1+ 5 (act) ,2* + 34 (Gat), + 130 (at) y? teal. 


The coefficients of the successive powers of z have the same form as those for 
the expansion obtained by Method A. Here, however, 


a; = (d' log V/dz*),,; V =e? (m+ 2)? (a—m—z)*~%. 
The partial integral y [ ez" (a—z)'dz may now be expressed in terms of 
“0 


the incomplete moments of Type I curves about the mode which may, in turn, be 
expressed in terms of incomplete Beta functions. These have been tabulated within 
a certain range, and, for functions lying outside this range, auxiliary tables have 
been constructed to aid in their evaluation. 


The ¢th moment of (m+ z)? (a -m — z)? about its mode is 


a~-m 
[ (m+ 2)? (a —m— 2 2'dz 
—m 


=(-) {' 2? (a—z)'(m—z)'dz 


2 a 
=(—f = (-)i'm" I’ 2Pti (a — z)t {t!/t!(t—1)!} dz 


7=0 


= (—)arrat E(pt DE @+)) | m — tm'-1 (p+I)a 


T'(p+q+2) (p+q+2) 
t(t-1) 4.  (p+1)(p+2) . 
. ie (p+q+2)(p+qt+3)_ 
ee (p+1)(p+2)...(p+tyat 
a (p+qt+2)(p+q4+3)...(p+q+t4l]) 


= (-)tarten CT (p+1)P(q+)) 


Oe aie’ t 


T(p+q+2) 
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e 
Hence the total area NV is equal to 


Cpt) @+)) 

- — m)-IqPptat 

Ymm-? (a — m)~2aPr4 l'(p+q+2) 
When evaluating the incomplete integral of y, M; has to be replaced by the 

corresponding series of incomplete Beta function ratios. 





[1 — bs Mg + bgM,— 65M; +...] (v). 


Examples. 
I. y = yoe* 2” (40 — 2); 
= yo2? (40—2z)9V; yo=10-™, 
where V =e77 2? (40 — z)™9, 


Choose those values of p and q which satisfy 


On (dhe Vids, o—- 14 eee 


10 30° 
30—p) 60— 
0=(d*log Vide},= —' a meine 8 


i.e. 


p=275, ¢q=825. 
The constants a, =(d‘ log V/dz*),, may be readily calculated from 
(30-—p) (60-q) 
att Ve 7) 
Tee [ TOF 30" 
t—1)!25 . 
aad (aa Vas 
Substitute these values in (v) and we have the following series for the total area: 
N =79210 (1 + 002,63380 — -001,27338 + -000,22961 
+ 000,06981 — -000,02788 + -000,04546 — ...) 





= 79342°867. 
This involves an error of °0028°/,. When we include seven terms in the 
expansion of y, the resulting curve fits y very well at points lying within the range 
+ 2°50 of the mode, but may de~iate somewhat at points lying outside this range. 
I]. y = yore 2 (150 — 2), yo=10%e* (mode at z= 50). 
We have = y=yo (2 + 50)?(100 — z)2(1 +. ag29/3! + aget/4!...), 
where a; is the ith logarithmic derivative of V at z=0, V being the expression 
e-# (z + 50)? (100 — 2)! 


The values of p and q found by equating to zero the first two derivatives of 
log V at z=0, are 


p=275/3, q=2p=550/3. 
The series giving the total area N is 
N=c(1 +'0005,39144 — 0002,44637 + -0000,12131 
+ 0000,44450 — 0000,02193 —...) 
=c¢(1°000,353282) and c=8409,952. 
Therefore N = 8'412923. 
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The area of the curve found by quadrature is 8412901. Hence N differs from 
the true value by less than ‘0003 °/,. Had we taken N =c, i.e. neglected all terms 
except the first in the expansion, the error would only have been 03°/,. This in 
itself is sufficiently accurate for most statistical purposes. If we include six terms 
in the expansion of y, the resulting curve fits y remarkably well at points within 
+ 3o of the mode. 


The series obtained by the application of Method A is 
N’ =c' (1—°004,33884 + 002,01703 ...). 


Its convergence is not nearly as rapid as the series obtained by the method 
of this section. 


Although the latter method gives far greater accuracy than Pearson’s, it has 
the serious drawback in being extremely laborious in application, especially when 
attempting to find partial areas. For this reason we will develop a method of 
fitting a Pearson type curve which depends on the equating of the first four 
logarithmic derivatives at the mode. It is hoped that the resulting fit will be 
sufficiently good for statistical purposes. An incomplete integral of the original 
curve would then, with the exception of certain cases, be expressible in terms of 
a single incomplete Beta function. 


§ 5. A systematic Fitting of Pearson Curves to certain Frequency Distributions. 


Let f(z) be a bell-shaped distribution and let a;, ag, ag, ... be its successive 
logarithmic differentials about its mode. 


Let y be a Pearson type curve referred to its mode; its differential equation 
is then 





1 dy _ d logy Pepe ee... (i) 
yas de inthetles®t “CUCU 1). 
Differentiate both sides of (i) successively with respect to z, we have 
a logy _ . _ 2 (b+ has) 
dz® bg thyz+bez® (bg + by 2 + bez?)?’ 
Plogy 2b, + Ghaz “ 22 (by + 2bez)* 
d® — (bgp + by z+ bez”)? (bg + b12 + bg2?)?’ 
d logy _ i 6by « 2 (3b? + 18), bez + 24d? 2” _ 62 (by + 2b,2)% 
dzA (bg + by z + bgz?)? (bo + byz + by 2?) (bo + 612 + bg2”)*’ 
etc. 


Equate now the corresponding logarithmic differentials of y(z) and f(z) at 
their respective modes. We have the following set of equations to solve for the 
unknown 0’s: 

1/bo = dg, 
— 2b,/be* = (3, 
6 (b;” = bo be)/bo® = (4. 
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Hence bo = 1/az, 
bh=- a3/2a,?, 
be = (3as" — 2ayae)/12a¢°, 

ldy Z 

yde age 1 

dg 2a? 12a* 





and ; 
(3a3" — 2agae) 2* 


The discriminant of the quadratic in the denominator is 
Hence the condition for a Type IV curve is (D < 0) 
9a" > 8a4a2. 
Condition for a Type V curve is (D = 0) 
9as? = Saga. 
Condition for a Type III curve is (bz = 0) 
3a32 = 2ayde. 
Condition for curves lying above the Type IV line is 
9as* < Baye. 
For a Type I curve the roots of the quadratic must be real and of opposite 


sign, i.e. 
Sas” < 8a,a ’ 
ae 


and 3a32 < 2a4d2. 
The first condition is here implied in the second. 


For a Type VI curve the roots of the quadratic must be real and of the same 
J q 
sign, i.e. 
9as" < 8a4d¢, 
3a” > Qayao. 
The curve is symmetrical when as is zero. The differential equation then 
reduces to 
ldy 6ag"z 
ydz 6ag—a42* 
dg is always negative for bell-shaped curves; hence, the following types arise 
according to the sign of aq: 
da4< 0 gives a Type II curve. 
a@>O0 , » » Vi, 


dga=0 ,, the normal curve. 


Note that the differential equation (i) can never represent a J-shaped curve, when 


J («) is bell-shaped. 
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The above method may be easily genera‘ised to the fitting of higher order 
Pearson curves, i.e. curves satisfying the differential equation 
Ldy | ‘ 
ydz bo t+ byz + bgz* + bgz? +...’ 
by equating the differentials of log y, at z=0, to the corresponding logarithmic 


differentials of f(z) at its mode. The resulting simultaneous equations may be 
solved successively for the unknown b’s in terms of the known a’s. 





Table to determine the Type of a Curve. 











Main Types Criterion Transition Types Criterion 
I 3a,” 24 2a42 Ill 3a3?= 2a, ag 
VI Bag > Yaz? > Gaya V 9a;2 = 8a, de 

IV 9a,? > 8a4 dq II a3;=0, ay<0 

— Normal a3=0, a,=0 

— — Vil a,=0, a,>0 

















Let u(z) be the Pearson type curve fitted by the above method to f(z). Write 
I (2)=u(2)V, where V = f(z)/u(z), both curves being referred to their respective 
modes. We may expand log V by Maclaurin’s theorem into the following series: 
log V = ao’ + ay'2z + dq’2*/2!..., 
where a;’ = (d'‘ log V/dz") at mode- 
The first four a’s vanish ; hence 


log V = as'25/5! + ag’28/6! +..., 


or V = exp. (a5'2°/5! + ag'2®/6! +...) 
a ds'2° dg 2° ay’? (ds* | 2a). 2” 
Therefore F (2) = w(z) [1 + ag’ 29/5! + ag’ 28/6! 4+...]. 
Examples. 
I. y = yoe* (2 + 10)" (30 — 2)” (origin at mode) ............... (i). 


The first four logarithmic differential coefficients of y at the mode are, 
respectively, 


aq=0; dg = — 11/30; 
dz = 5/90; a4 = — 166/9000. 
3a;* is less than 2a4d2, hence the curve to be fitted is of Type I. This curve is 
readily found to be 
y = yo (9°828,086 + 2) 219.48 (38°473,920 — z)iorssee ee. (ii), 


o =1°65. 
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yo is determined by equating the ordinates of (i) and (ii) at their respective 
modes. The total area of the curve (ii) is °793,2284, and that of (i), found by 
mechanical quadrature, is “793,4511 (yo is taken to be 10-% x e)—a difference 
of 028 °/,. 


The following table of ordinates is constructed to examine the goodness of fit 
of the two curves: 
































z Ordinates of (i) | Ordinates of (ii) Z Ordinates of (i) | Ordinates of (ii) 
=¥ “000,000 -000,000 2 “098,488 “098,493 
—6 *000,005 *000,005 3 “045,113 *045,127 
—5 *000,277 *000,273 4 *015,926 *015,946 
—4 *004,241 *004,227 5 *004,413 *004,429 
—3 "026,528 026,511 6 “000,972 “000,980 
=€ “084,595 “084,588 7 “000,171 -000,174 
-1 *158,609 *158,608 8 “000,024 °000,025 
0 "192,456 *192,456 9 *000,003 *000,003 
1 "161,593 “161,593 1¢ “000,000 -000,000 
I. y = yor * (2 + 50) (100 —2z)™, yo = 10-™ (origin at mode) ...... (i). 
The first four logarithmic differentials of y at the origin are, respectively, 
a= 0, dg = — 55/1000, 


dg= 13/104, — ag= — 105/108. 
Here again 3a;* is less than 2aya2, hence the curve having the same first four 
differentials as y at its mode is of Type I. Its equation is 
F = yo’ (48°73692 + 2)"™7.95 (1114-94062 — 2)*18300.81 
o +425. 
The total area of the latter curve is 841285, which agrees very well with the 
area (8°412,902) of the curve (i) found by quadrature. 





In the following table the ordinates of both curves (i) and (ii) are compared at 
intervals of 2 for the se ES B 




















z Ordinates of (i) | Ordinates of (ii) Z | Ordinates of (i) | Ordinates of (ii) | 

om ihe, 

“a | | | | 

— 22 “000,000 “000,000 4 | *514,603 *514,607 

—20 “000,001 | “000,001 6 “305,561 *305,569 
-18 *000,016 | “000,016 8 *149,209 "149,225 
-—16 “000,195 “000,194 i “060,260 *060,279 
—14 ‘001,607 | “001,603 12 | “020,221 “020,236 
—12 *009,265 “009,254 14 } “005,659 *005,668 

—10 038,581 | “038,564 eee *001,324 *001,328 

- 8 *119,034 119,018 as *000,260 | *000,261 
— 6 *277,991 *277,983 20 “000,043 | *000,043 
= 4 “500,464 “500,462 22 *000,006 *000,006 

- 2 “705,414 | "705,414 24 *000,001 “000,001 

0 *788,861 788,861 2 *000,000 *000,000 | 

2 ‘707,866 |  °707,869 | | 

| | 
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III. y =yoe (2 + 50) (24150) ooceecceccceeceeeeeees (i), 
Yo = 10™ (origin at mode). 


The first four logarithmic differentials of (i) at the mode are, respectively, 


oe ee 
a= @2=~ 9950" 
ag = 2020 _ 6840 
897x502) 4 ~ 97% 508 


The Pearson curve which has the same first four logarithmic differentials as 
(i) is of Type VI, namely, 


Y = yo (48°456,364 + z)**8!0 (387-406,732 + 2)-27544175 
ao =9-05. 
Area of (i) found by quadrature is 953324. Area of (ii) is 9°53364. 
The series for the total area, N, deduced by Method A is very uncertain; it is 
N = 9°470,962 (1 — 028,26991 — -031,87332 +...). 


























z Ordinates of (i) | Ordinates of (ii) z Ordinates of (i) | Ordinates of (ii) 
— 32 *000,001 “000,001 20 *059,869 059,913 
— 28 *000,051 “000,049 24 *028,607 *028,653 
~24 ‘001,115 “001,100 28 ‘012,617 “012,657 
— 20 “010,095 *010,053 32 *005,189 *005,217 
—16 *047,650 *047,598 36 *002,007 *002,024 
~12 137,084 137,056 40 “000,735 ‘000,745 
ae 267,859 *267,853 44 “000,257 100,262 
oa “384,302 “384,299 48 “000,086 “000,088 

0 428,955 "428,955 52 “000,028 “000,029 
4 "389,261 *389,260 56 *000,009 -000,009 
8 *297,170 *297,174 60 *000,003 -000,003 
12 *196,083 *196,097 64 *000,001 -000,001 
16 *114,276 *114,308 68 “000,000 “000,000 








The above examples show that the Pearson curve which has the same first four 
logarithmic differentials about the mode as the curve y= yoe’ P(z) (when not 
very skew) fits it with sufficient accuracy for most statistical purposes at points 
lying within + 3o of the mode. The range of agreement of the two curves is 
further extended when a few terms of the expansion is included; but these render 
the finding of partial areas an extremely laborious task. It is desirable, therefore, 
to develop accurate and convenient methods for fitting integrable curves to the 
“tail” of these frequency curves. 


§ 6. A method similar to the one developed in the beginning of §3 may 
generally be applied satisfactorily to determine the area of parts of the curve lying 
outside the range mode + 2°5e. Thus, let z >2z be the tail of the frequency curve 
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y = yoe** P (z) and let a; be the ith logarithmic differential of y at the “stump” 
z= 2%. Then, by Maclaurin, 


gag ert tatett 
; 2f?/2! 
= y (zo) eet PTY + agt8/314 ayh4/4!+...], 
where C=z—2%. 


The series in brackets is identical in form with the one deduced by 
Method A (§ 3). 


Evidently, 

iT - gata)? 

i = a 

y=y(zo)e"e sag [1 + a3 09/3! + agf4/4!4+...], 
1 | 
where ; bs “Fi a)’ 21 = 01/2. 
Hence, area of tail z > 2 is 
12° 1 


z o ——.(f+2,)? 
N', = y(20)e?” | e we ™ [L + asf8/3! 4 agl4/4!4...] db. 
70 


This may be expressed in terms of incomplete normal moment functions. Note 
that the first approximation is 


where 2’=~, $(1 —«a,,) the area of the tail of a normal curve, and 2z,, the 
Cc 


ordinate of a normal curve at the point 2’. This is the expression obtained by 
Burton H. Camp* for the determination of the tail of a frequency distribution. 
Only in rare cases is it necessary to go beyond the 6th term in the above 
expansion of V. 

Examples. 

I. Fit a curve to the tail z>4 of the curve 

(i) y = yor *(z + 10)” (30 — z)™, ya=e"l0-™, =o =165 


(origin at the mode). 


The first six logarithmic differentials of y at the point z= 5 are, respectively, 
a =— 14, dg = — *2293,33333, 
aj= 0100,97777, ay = — 0044,77155, 
as=  °0008,0069215, dg = — 0003,4554058 ; 


and the curve is 
a —1-4¢—--22933 r 
(ii) y = 0044,1287 e “ [1+ 0016,8296 ¢ 
— ‘0001,86548 £4 + -0000,06672 ¢5 
+ -0000,00936 f° —...], 


where f=2-—5. 


* Biometrika, Vol. xvi. p. 163. 














90 Asymptotic Formulae for the Hypergeometric Series 


The point = —1 is approximately 2°50 away from the mode. 


The following table compares the ordinates of (i) and (ii) at unit intervals of 
the argument z: 








z Ordinates of (i) Ordinates of (ii) 
4 °01592,63 *01592,64 

5 *00441,29 *00441,29 

6 *00097,18 *00097,18 

7 *00017,145 *00017,145 

8 *00002,433 “00002, 435 

9 *00000,278 *00000,279 
10 *00000,025 *00000,026 
11 *00000,002 *00000,002 

















Il. Fit a curve to the tail z> 10 of the generalised Pearson curve 
(i) y =yoe * (z+ 50)(100—z)™, yo= 102", o = 425 
(origin at the mode). 
The first five logarithmic differential coefficients of (i) at z= 10 are, respectively, 
4 =—'5, dg= — 733, 
adg= *00051,44033, a, = — 00006,00137, 
as= °00000,24768. 


The required curve is 
~ a $42 - $ 
(ii) y = 0602,6002e' E +00068,58106 (5) 


o\4 o\° 
— -00004,00091 (5) + .00000,06605 (5) + Sy . 

















where =z—10. 
Zz Ordinates of (i) Ordinates of (ii) 
10 *0602,600 *0602,600 
12 *0202,210 -0202,210 
14 0056,585 *0056,585 
16 *0013,241 -0013,240 
18 -0002,596 *0002,595 
20 -0000,427 -0000,426 
22 0000,059 | -0000,058 
24 -0000,007 *0000,007 
26 | *0000,001 | *0000,001 
| 


Il. Fit a curve to the tail of the generalised Pearson curve 
(i) = y =yoe * (2 + 50) (2 + 150)-%:2>20, yo= 10%, o =9-05 
(origin at mode). 
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The first six logarithmic differentials of (i) at z = 20 are 


a, = — 173,669,468, dg = — 005,741,120, 
a3= °000,204,74032, a4 = — 000,009 4929675, 
as= *‘000,000,55935840, ag = — *000,000,04045132. 


The required curve is 


a 
= q+ se 3 
(ii) y="059,869e ” E + 002,183,897 (3) 
o\* ce 
— 000,101,258 (7) + 000,004,773 (7) 


t 6 
+ 000,002,154 (3) ~s a ; 














where f=2z-20. 

z Ordinates of (i) Ordinates of (ii) 
20 *059,869 059,869 

24 “028,607 “028,607 

28 | 012,617 “012,617 
32—CCS| *005,189 “005,190 
36 *002,007 “002,011 
40 000,735 *000,741 

44 000,257 “000,262 

48 000,086 “000,089 

52 *000,028 000,030 

56 “000,008 | “000,008 











In the above examples, with the possible exception of the last, in which there 
happens to be an appreciable degree of skewness, the agreement between the two 
curves is very close indeed. We conclude, therefore, that when the curve 


G (z) = yoe? P(z) 


is not very skew, the tail z>£, where € does not lie within mode + 2°5¢, may be 
fitted very closely by the curve 


y = G(E) e284 —- 822 1] + ag (2 — £)8/3!+ ay (2 — E)/4!4+...], 


d' : : , 
where a; = 7 {log G(z)} at z= & Hence integrals of G (z) up to any point >utside 


d 


mode + 2°50 may be expressed in terms of incomplete normal moment functions. 
y 

The rapidity of convergence of the sequence o‘a;/t! (¢=3, 4, 5, ...) will give a good 

indication as to how many terms of the expansion y need be retained. 


It has already been shown (§ 5) that integrals of G(z) up to points lying within 
mode + 2°5¢ may be expressed very closely by a single incomplete Beta function. 

Finally, integrals of very skew and J-shaped curves are best evaluated by an 
avcurate quadrature formula such as Weddle’s or Euler-Maclaurin’s. 
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§7. Methods for Obtaining Approximations to the Sum of 
a Tail of a Hypergeometric Series. 


When the standard deviation of a hypergeometric series is sufficiently large, 
the above methods for finding approximations to the sum of a finite number of 
terms serve very well indeed for regions within + 2°5¢ of the mode but do not 
always give satisfactory results for more remote regions. In problems on probability 
the tail of a distribution is often of greater importance even than the body itself. 
For this reason it is desirable to develop more accurate methods for approximating 
to this tail. 


Let, then, the following series of constantly decreasing positive quantities, 
namely, 
do ++ dg+dgt..., 


represent the tail of a hypergeometric series F(a, B, y, «). 


Draw up the following table of the successive ratios of the terms a;: 



































Terms ist ratios 2nd ratios 3rd ratios 4th ratios 5th ratios 
A 
a 
r\=— 
a . a To 
1 ‘ Pi cg ‘ 
To= 2 i val _? 
re aay n= Pi a, —-% 
. a3 = V9 q _ Ps 1 . Us 
fae a 1= 
a “2 "4 2 et % 
3 = = 42 
a Ps rs q Ps . gJ2 
ae gs 
a a3 Me V5 Ps 
. as t 4 
f= 
. a 
a; 
Clearly 
aH : dg As . 
—=711, —=T171"9,...-, — =741%q...7; 
ao alo ao 
re="1pPi, Ts=TipPipe,---, Vs = 71 Prpe--- Psi, 


P2=7191 Ps=PiNde>--+> Ps= P1491 J2-++ Ys—1y etc. 
Hence 


Ao + 44+ dg+ Ag+... 
= d9(1 +71 +72 pi +713 pr? ps + ... Pepa po® ... Ps +--+) 
=do(L+ritrP pit rie pi ga + rip’ qi? ge +--+) 
=a (1+ rit+r2pi t+ rPpirga + ritpi® quits + -.--) 


s(s—1) s(s—1)(s—2) s8(s—1) (s —2)(s—3) 
=do > ry'pr l.2 qr 1-2-3 wy 1.2.3.4 
s=0 
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If the first ratios are sensibly constant, p1=q:=...=1 and the series reduces 
to the simple geometrical 
ay (1 ++ 2+ r+ oe 
We may, without fear of appreciable error, take the upper limit to be infinity. 
Hence a first approximation to the sum of the series is 


ado 


Si 





a 1- T1 ; 
If the second ratios are sensibly constant, i.e. g; =u =v, = ... = 1, to the figure 
of accuracy required in the result, the series reduces to 
2 2¢-2 
a > ryipr 1-2 =ag(Lt+ritri2ps + ripi? + ritpi® + ...). 
s=0 


The first three terms of the original series suffice to determine the values of the 
constants d@,7, and p,. Write r1:=7r, pi1=1—p. If the tail of the series does not 
lie too near the mode, p, is close to unity, i.e. p is a small quantity. Hence 

ro) 8 (s—1) re) 8 (s—1) 
ao = ps 1.2 = do & re(1 —p) 1.2 
= do[l+r+772(1—p)+73(1—3p+ 3p? —p*) 
+r*(1— 6p + 15p* — 20p* + 1ip* — 6p> + p®) 
+75 (1—10p + 45p? — 120p* + 210p* — 252p* + ...) 


=dao[l+r+r27+77+... 
— pr* (1 + 3r + Gr? + 107° + 15r4*+ ...) 
+ 3p?73 (1 + 5r + 15r* + 357° + ...) 
— p®r3 (1 + 20r + 120r? + 45577 + ...) 
+ L5ptrt (1 + 14r + 91r? +...) 


The above series may all be summed quite readily. Thus 
1 
l4r4774 7°4+...=: : 
l-r 
1 
+ 6r2 3 a 
1+ 3+ 6r7 + 10r* +... a-n 
1 
» 4. 157% + 3377 ei cise 
1 + 57 + 1577 + 357° 4 ... =a = 
1+ 20+ 12072 + 458024... = Prt 
(l-r)’ 


Hence a second approaimation to the sum of the tail is 


em a) ye + 2 os ar es 733 (1 + 13r +r*) 
= 7%, [1 p(y) +3 aoa ES 





(1-r)® 
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The first three terms are generally sufficient for computing purposes. The next 
approximation will involve an additional constant q.. 


Ss3= do (1+ r+ r?2pi t+ 7? pi3 qi + r*piegit + «..) 


= s(s—1) s(s—1) (s—2) 
=a = pr 1.2 n 1.2.3 
s=0 





Write qi = 1 +4. Generally q is very much smaller than p so that g? and higher 

powers may be safely neglected. 
Ss = ao [lL +r +72p, +13 p3(1 +9) + rtp (1 +q)*+ -..] 
Sao [1 +r 4+ r2p, +r? pi? + rp? +...) 
+ qr* pe (1 + 4rpi3 + 10r? py? + 207? pi” + ...))]. 

The sign = denotes “approximately equal to.” We will now obtain an 

approximation to the sum of the series _ 
S’ =1 + 4rp,° + 10r7 py? + 2073p: +... 


px is less than unity because the convergence of the tail of the hypergeometric 
series is more rapid than that of the geometrical ap (1+ r+7r?2+7°+...) whose 
first two terms agree with those of the tail of the hyperseries. Hence 


S’<14+4r4+10r+...=(i-r)* 
Moreover, 


S’< 1+ 4rpi3 + 107? pi + 2077p)? +... = (1 — rp8)*< (1 -— ry 
Hence (1 — rp,*)~* is a better upper limit to the sum S’ than (1—r)~*. There 


is, however, very little difference between these two expressions because py is close 
to unity. 


When p=(1 — py) is sufficiently small, say of the order ‘01, we may obtain a 
still closer approximation and, incidentally, a lower limit to 8’. 
S’ = 14 4rp2 + 10r? py’ + 2077p,” + ... 
=1 +4 4r(1— 3p) + 10r* (1 — 7p) + 20r3 (1 — 12p)+... 
= (1 + 4r + 10r? + 207? + ...) —rp (12 + 70r + 2407? +...) 
=[(1-—r)* — 2rp (6 — r)(1- rr). 
Evidently 
(1 — rp,*)* > S’ >[(1 — r)-*— 2rp (6 —r) (1—r)-*}. 
Hence a third approximation to the sum of the tail is 
Ss = do (1 — r)* [{1 — pr? 1 — r)-? + 3p?r8 (1 — r)4 
— rp3 (1 + 13r + 7?) (1 —r)-} 
+ qrpi (1 — r)-* {1 — 2pr (6 —r) (1 —r)*}], 
where p=l-p, q=qu-1. 
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Examples. 
I. Sum the tail of the series F (30, 30, 111, 75) starting with the 28th term. 
The tail of the series is 


°796,454 + °502,267 + -314,369 + .... 
We readily find 


71= °6306,2901, 


pi= “9925,014, p= 007,4986, 
~ qi= 1-000,64443, q="000,64443, 
w= °999,93951, 1— w= u=000,06049, 


ee rr. 


The true sum of the tail found by computing each term separately is 2°117,841. 
Furthe: , 


S; = 2°156,244, 

Sg = 2'156,244 (1 — -021,8576 + °002,27275 — -000,39944) 
= 2°113,153, 

Ss = 2°113,153 + 004,2444 
= 2°117,397. 


This differs from the true value by only 4 in the fifth place. 


II. Sum the tail of the series F (50, 50, 100, $) starting with the 81st term. 
The tail of the series is 
dp (1 + °578,123 + 333,418 + 191,836 + ...). 
Its true sum is 2°360,158ap. 
We readily find 
r= °578,12289, 
m= °997,581112, p =002,418888, 
gi = 1:0000,604352, q = 0000,604352, 
Sy = 2°370,3585a9, 
So = 2°370,3585a9 (1 — 004,54239 + -0000,7138 — -00000,429) 
= 2:359,7505d, 
Ss = (2°359,7505 + °000,33479) ao 
= 2:360,085ap. 
The last approximation differs from the true value by 7 in the sixth place. 
III. Sum the tail of the series F (50, 50, 100, 4) starting with the 61st term. 
The tail of the series is 


S = ao (1 + 617,161 + 879,270 + 232,118 + ...). 


True value of S is 2°585,789ap’. 
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We find the following values for the constants: 
r= ‘617,160890, 


pPi= °995,7540,680, p= 004245932, 
qi = 1:000,13065, q = 000,13065, 
and S, = 2°612,0623ay’, 
Se = 2°612,0623a, (1 — ‘011,03411 + 000,59183 — -0000,2058) 
= 2°584,7335ay’, 
S3 = (2°584,7335 + °001,1399) ao’ 
= 2'585,873a9’. 


S; differs from the true value by 1 in the fifth place. 


IV. Sum the tail of the series F (— 100, — 100, 1, $) starting with the 10th term 
after the mode. 
Modal term is the 42nd. We find 
r= ‘443,97189, 
Pi= °923,73046, p= 076,26954, 
1 = 1:000,156141, q = °000,156141. 
True sum of tail is ‘751,702 multiplied by a constant which will be dropped in 
the subsequent work. 
S, = ‘781,4122, 
Se = '781,4122 (1 — 048,626 + 015,9773 — :005,09085). 


p is too large to apply formula S, with any great degree of accuracy. It is 
interesting, however, to examine how closely the successive approximations to S2 
represent the true value of S. These approximations are 

(i) °781,412, (ii) -743,415, 
(iii) 755,900, (iv) °748,746. 

The formulae S is not entirely satisfactory when p is larger than ‘01. For such 
cases, we may obtain a very close approximation to S, by the application of the 
Euler-Maclaurin theorem to the summing of the series 

s(s—1) 


1.2 


The problem, however, will be approached from a wider angle, namely, by 
applying the Euler-Maclaurin theorem to the summing of the more general series 


e s(s—1) s(s—1)(s-2) $s (s —1) (s — 2) (s 3) 
> rp, 1.2 n ee uy 1.2.3.4 


s=0 


§8. Application of the Euler-Maclaurin Theorem to the summing of the Series 
a s(s-1) —s(s—1) (s - 2) 8(s —1) (s — 2) (8-3) 
B<m inn tg ' 8? 1.2.3.4 
s=0 
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Write log, m1= a, 
log, pi = b, 
log. qi =, ete., 


s-1) s(s—1)(s—2) 

. Nh lasts PO he i ; 

then S=d % pe. ee ns Se, SD ee Siestk Shaw (i). 
s=0 


The quantities a, b, c, ... generally tend rapidly to zero. Clearly, 
s(s—1) 





ote, ast bo ( s(s—1)(s—2) s(s—1)(s—2)(s—3) 
on te 28 [1+ fe tas 6 ** 1.2.3.4 bade 
1 { s(s—1)(s—2) s(s—1)(s — 2)(s—3) i = 
‘er tay iit. ee 
0 ee a “ =“ 
=a Le * TS ett +e + a8 ie enn (iii) 
s=0 
ae ee ee * 2 3 ; 
ig De FE ye Sb Cah” Fn.) 5 oc coos crs insnsepediensahciieupeuseae (iv), 
where Co’ =— 6/2, c’ =—(a— $b) 
1 
RS a SE a ae a 
= > ent e aga (+19) [1 +cos + s*+...], 
s=0 
where m=t'o, c= Vex)’ 


Hence, by the Euler-Maclaurin theorem, 


e 1 
o --5(« 
e 20 


NS = age a1" | 


+2,¢) ° , 
- {L+cow + a7 + ...} dat z,| 


0 


~ &) 


lr? Po ae ‘ ° 
=e" |e | eo l+fethe+..Jde+E,| saxetaseanen (ivb), 


where £, is the Euler-Maclaurin correction *. 
* We may also write 
S=a,et™" [« | rae 2 {1+i,2+8 27+... det E, | ‘ 
where Z,’ is approximately equal to 


-0387,153a, —*1578,127a, +*2411,459a, —*1220,458a,. (See § 10, p. 106.) 


More accurate expressions for FE,’ are the following: 


oF,’ sE, 

$a, +0°195,7755a, $a, +0°213,0246a, 
- 0°460,3838a, - 0°589,0198a, 
+0°546,5364a, +0-964,0148a, 

— 0°471,4289a, - 1-240,8904a, 

+ 0°260,6070a, +1:140,5644a, 

— 0°082,4736a, - 0'720,9438a, 
+0°011,3674a, 4-0°297,8547a, 

— 0°072,4970a, 

+ 0°007,8926a, 


These last two expressions are obtained from a form of the Euler-Maclaurin theorem due to Professor 
Karl Pearson, the former based on sixth differences and the latter on eighth differences. 


Biometrika xxv1 7 
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o 
The integral | e~ ** ged is the sth incomplete normal moment function and 


x 
has been tabulated up to the twelfth. S can, therefore, be expressed in terms of 
incomplete normal moment functions. 
As is frequently the case, the constants ¢, d, e,... are sufficiently small to be 
neglected altogether. Then 
SZage*™ (o | e7* da + Ey) Sriatitaaseneneneneeness (v). 
x 


This is, in effect, the expression obtained by Professor Burton H. Camp* for 
the sum of the tail of a discrete frequency distribution. He approached the problem 
differently, namely, by fitting the tail of a normal curve ‘to the tail of the distribu- 
tien. (v) has been put in the convenient form+ 


Sao {* ms on + v Ge Retetkek divkekc sevice ceastcen se (vi), 


% / 
8—a? 15 —-—10a,?+ 2,4 
“Get + B5a0t | 
This formula has been applied with a large measure of success to the summing 
of the tail of a binomial and hypergeometric series in which the last element ~ is 
unity. It fails, however, when the stump lies within 2¢ of the mode, and is only 
reliable for points outside mode + 3c. Similarly, when applied to the general 
hypergeometric series, the results are reliable only when the stump lies outside the 
range mode +3c. When the stump lies within mode + 3¢ it may be found 
necessary to include a few terms in the expansion (ivb). The formula (vi) is 
essentially equivalent to the approximation S_ obtained for the series 





where ¥=5+750[1+ 


When, as is very frequently the case, p = 1 — p; is not appreciably greater than ‘01, 
S holds several advantages over formula (vi); it may be applied more easily, the 
contribution of each successive term is made clear, and, since S2 is concerned only 
with the fitting of a series to another, no correction such as the Euler-Maclaurin 


need be applied. 


Retaining the constant c in (i), we find 


1 
S Zag S git’, alta {1 +08 e—" I 


s=0 1.2.3 
?.¢ 
Note that c= logeq = log. (1+ g)=9—-g t+ g--- 
s(s—1)(s— 2) s(s—1)(s— 2) 
Therefore {1 +08 C= {1 de Cre ae . 
8 (3-1) (s — 2) es ae 
Moreover, (l+q) 1.2.3 Pa Ppt mea 2) 


* Biometrika, Vol. xvi. p. 163. 
+ Tables for Statisticians and Biometricians, Part 1, Introduction, pp. xxx—xl. 
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Hence, a better approximation to S is 


1 
jr? @ Pa he {1 + SSI 


— 1.2.3 





s=0 


This differs from (vi) by the quantity 


2 0 2 2 2 
es avet™ < [e 33 (+219) _ s(s—1)(s— | r) am 
0 


1.2.3 


This is essentially the quantity added to S, (p. 94) to obtain the closer 

approximation $3. It is identical with the series 
agqr®p,? (1 + 4rp:* + 16r*p,’ + 2078 py” + ...). 

M accordingly satisfies the inequalities 

agqr*p? (1 — r)-* > aggr*p,? (1 — rp,*)* > M 

9 a — 
> aoqr® pi? (1 — r)* {1 - eared ccughhaquatiodt (vil). 
These inequalities give a good indication of the accuracy of the result obtained 


by the application of Camp’s formula (vi) and whether it is necessary to go a stage 
further in the approximation. 


co 
s 
- 


s= s 


[same expression ]. 
3 


M may also be expressed in the form 


2 ws —~lg? — 
aq js o e 3* {a (a —2)} {o (w@—ay)—1} [o (@— 2) —2} da + z| (viii). 
= i +a, 


o(s— E)e=3) 


If t. =< ets+bs(s— D2 | 7 . 


then ts = 7°p,°, 
s ll 5h 
tg = pi (5 +a+t+ 5) . 


9 


- 


c* = ry3 (1 +6a+ ) : 
neglecting squares and products of a and 6. 
Exp o 11+6a+15b 2+12a+41b 
id OS 
Since q is itself small, one term only of Z generally suffices. The evaluation 


of (viii) is often a little troublesome. However, in most cases the right-hand side 
of inequality (vii) gives sufficiently accurate results. 


Hence 


The most satisfactory way so far, then, of finding the sum of a tail of a hyper- 
geometric series when the “stump” does not lie within the mode + 2°5a is to. 


(a) obtain the approximate sum by the application of formula (vi), 
(b) correct this result, when necessary, by evaluating M. 
When p=(1—>p,) is not appreciably larger than ‘01, the right-hand side of 


inequality (vii) may be taken as a good approximation to M. In all cases, however, 
M is less than agqr*p,* (1 — rp,®)—*. 


7-2 
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Examples. 
I. (See above.) Sum the tail of F (30, 30, 111, °75) starting with the 28th term. 
The approximation obtained by using (vi) is 2°113,316. 


The correction (taking into account the value of q) obtained from the right- 
hand side of inequality (vii) is 004,2444. Hence, 


Corrected sum = 2°117,560 
True sum = 2°117,841) © 


II. Sum the tail of F (50, 50, 100, 4) starting with the 81st term. 
Formula (vi) gives 2°359,836, 


Correction = ‘000,335, 
Corrected sum = 2:°360,171 
True sum = 2°360,158) ° 


III. Sum the tail of F'(50, 50, 100, 4) starting with the 61st term. 


Formula (vi) gives 2°584,6630, 
Correction from (vii)= *001,1399, 
Corrected sum = 2°585,803) 

True sum = 2:585,7895 f 


IV. Sum the tail of F(—100, — 100, 1, 4) starting with the 52nd term. 


Formula (vi) gives *751,738 
True sum is *751,702) ° 


In this case it is not necessary to apply any correction, formula (vi) gives 
sufficient accuracy. The result cannot be trusted in the fifth place because y does 
not converge sufficiently rapidly for the approximate value (vib) to give five or 
more figure accuracy. Applying the left-hand side of inequality (vii) we readily 
find that the correction is less than ‘0000,3325, i.e. the result differs from the true 
value by less than 3°3 in the fifth place. The right-hand side of (vii) does not give 
a closer result because p = ‘07 is rather large. 


§9. On the Fitting of the Tail of a Type III Curve to the Tail 
of a Frequency Distribution. 


The above examples shew clearly that Camp’s method for the evaluation of the 
‘tail of a discrete frequency distribution by fitting to it the tail of a normal curve 
is not always adequate, especially in those cases in which the third ratios q; differ 
appreciably from unity. In general, the correction given by formula (vii) makes 
a considerable difference in the results; the accuracy being improved by one, two 
and sometimes three figures. 


The fitting of the tail of a normal curve to the tail of a frequency distribution 
F (z) depends on the evaluation of three constants. We can, alternatively, fit the 
tail of the Type III curve 


Fe He (SEP. ioc eeserccsvucsseece tes veestons (i). 
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This will depend on the evaluation of four independent constants and, accordingly, 
we should expect a certain amount of improvement in the results. 


The cases, F(z) continuous and F(z) discontinuous, will be treated separately. 
F(z) continuous. 
The problem will be approached by fitting (i) to the tail z>z of F(z) by 


equating the corresponding logarithmic differentials at the stump. 
Let u=log F(z) and a, - ©. F(z) at z=z. Equate these to the logarithmic 


differentials of (i) at z=0. There result the following equations, which have to be 
solved for the constants of the Type III curve: 


—ptt/a =a, 


— t/a? = ag, 
2t/a* = dg. 
Clearly, a = — 2ds/az, 


= — 4a,°/as", 
p = (2ae"/a3 — ay). 
Yo may be found by equating the ordinates at the respective stumps. 
Area of the tail z> z of F(z) 


- = F (20) | ee? (a+ 2)'dz 
a J0 


= — F(z,) e* [ estas 


where @, is the area of the tail z>a of the Type III curve y= yoe*2' and y, the 
ordinate at z=a. The quantity a, may be found from the Tables of Incomplete 
T- Function. 


F(z) discontinuous = (do, a1, dg, «..). 


I. Consider the discrete terms a;(¢=0, 1, 2,...) as being the ordinates of a 
continuous function f(z) at unit intervals of the argument z, the first term do 
being given by z=0. 

Let a, be the first term of the tail of F(z). We require the first three logarithmic 
differentials of f(z) at z=s. These may be found as follows. Calculate (to ten figures) 
the natural logarithms of the first six terms of the tail, calculate the differences 
and substitute in either of the following sets of formulae giving the differentials 
in terms of the differences. 

Forward Difference Formulae* : 

hfyo =(A—$A?2+4A8— fAt+ 1 AS— J ASH...) fo, 
fq!’ = (A? — AS + 15 A4 — 5 AS + 17 AS— ...) fo, 


h3fol” = (A? — 3.044 TAS— IEA +...) fo. 


* Whitaker and Robinson, Calculus of Observations. 
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Central Difference Formulae* : 

hfe! = 3 (fr— fa) — th (8% — 8/2) + ly (Oh — BF a) — shy (OH OF ad + os 
Refo” = 84, — Be Ot + Io OG — Bho Oy + +s 
hfe” =} (8%, — B2f_1) — § (8%, — BFA) + zhu (OF, — Bf) + ---- 
The constants of the Type III curve are then found as in the previous case. 
If S is the sum of the tail of F(z), then 


0 


oo 
S= | ee? (a+z)+ z| ' 
where £ is the Euler-Maclaurin correction+. Furthermore, 


Sa, & ew [° e-P 2t'dz+ ¥| SEE) Bree ent ed en (ii), 


where 


1 1 1 ‘ 22 ) 
V=5 + 12a « = pa) ~ 60a2 {t (t¢- 1)(¢ = 2) — 3t(t = l)ap + 3ta?p? — a%p*} 


+ as a0qa tt 1) (t-2)(¢-3) (t—4) —5t(t—1) (¢-2) (t—3) ap 


+ 10¢ (¢ — 1) (t — 2) ap? — 10¢(t — 1) a®p* + 5tatp* — a®p*} — 3 ; 
= | 
If a, is the area of the tail of the Type III curve y = ye? 2‘, z >a, and 2, the 
ordinate at z=a, S may be thrown into the convenient form 


=a, (J+ y). 


II. Alternatively, we may find the Type III curve which passes through the 
first four terms of the tail of F(z). Thus, let y= ye? (a +z) be the equation of 
a Type III curve, and let d;, dz, ds be the first three logarithmic forward differences 
of the first term, a,, of the tail of F(z). The logarithmic differences of y at z=0 
are, respectively, 

Ist —p+tlog(a+1)/a, 
2nd tlog(a+2)a/(a+1), 
3rd t log (a+ 3) (a +1)/(a+ 28a. 


Equate these to the corresponding differences of a,. This results in three 
equations to solve for the constants p, a and ¢. Eliminate ¢ from the last two 
equations and we have 
(a+3)(a+1)3 _ 
~ (a+2°a 


Le. dz log (a + 3) — (3d_ + ds) log (a + 2) 
+ (3d_ + 2ds) log (a + 1) — (dz + ds) log a =0 { = X (a)}. 


(a+2)a 


ds log (a+lyP ‘ 


ds log 


* Sheppard, Proc. Lond. Math. Soc. Vol. xxxt. p. 465. 


+ The correction E may, perhaps, be obtained more readily by using the formulae in the footnote, 
p. 97. 
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This equation may be solved quite readily by forming a small table of the values 
of the function X (a) and interpolating for the value of a which makes X zero. 


Having found a, the other constants may be determined by simple substitution: 
The sum S is then given by formula (ii) above. 

Examples *. 

I, Fit a Type III curve to the tail of F (50, 50, 100, 4) starting with the 81st term. 


The first six forward differences of the common logarithms of the first term of 
the tail are, respectively, 


A =— ‘237,9798,316, A? = — 00105,17821, 
A® = + -000,0263,063, A*t = — 00000,08886, 
A® = + -000,0000,383, A® = — 00000,00021. 


Converting into natural logarithms and substituting in the forward difference 
formulae, the first three logarithmic differentials reduce to 


D =— 546,737,414, 
D* = — 002,484,344, 
DF= -000,0638,021. 
The Type III curve which has the same differentials at z= 0, is 
¥ = Yoo ™™ (77-876, 5589 + 5) PO 50... 0ccceees (iii). 


Yo is found by equating y(at z= 0) to the first term of the tail of the series. 
Hence yo = (77°876,559) —15086, 946, 








Terms of tail Ordinates of (i) Terms of tail Ordinates of (i) 

1-000,000 1-000,000 “000,678 *000,678 
*578,123 *578,123 *000,381 000,381 
*333,418 *333,417 *000,214 *000,214 
*191,836 *191,836 *000,120 *000,120 
*110,121 *110,122 *000,067 *000,067 
°063,073 063,072 *000,038 *000,038 
“036,046 *036,046 *000,021 “000,021 
“020,556 “020,556 *000,012 *000,012 
*011,698 “011,698 -000,006 *000,006 
“006,644 006,644 *000,004 *000,004 
*003,766 -003,766 “000,002 *090,002 
*002,130 *002,130 000,001 *000,001 
-001,203 “001,203 




















The fit of the Type III curve to the tail of the series is surprisingly good, and the 
area of the tail, corrected by the Euler-Maclaurin theorem, agrees to six figures with 
the true sum of the tail. This is a decided improvement on any of the previous 
methods. 


* In the first two examples the actual terms of the series have been multiplied throughout by a 
constant to reduce the first term of the tail to unity. 
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Il. Fit a Type III curve to the tail of F(50, 50, 100, 4) starting with the 61st 
term. 


Using the forward difference formula, the first three logarithmic differentials 
are found to be 
D = — *480,453,0646, D? = — 004,390,8871, 
D? = -000,139,3941, 
and the required Type III curve is 








Y = Yq eM (2 + 62-999,6028) ra... esc ee ee (iv), 
where Yo = (62°999,6028) —17°427,2117, 
Terms of tail Ordinates of (i) Terms of tail Ordinates of (i) 

1-000,000 1*000,000 “000,824 “000,824 
*617,161 *617,161 “000,484 *000,484 
-379,270 “379,270 -000,284 -000,283 
*232,118 *232,117 “000,166 *000,165 
“141,492 “141,491 -000,096 *000,096 
“085,915 “085,914 “000,056 “090,056 
051,972 *051,971 *000,032 “000,031 
031,325 031,324 *000,019 -000,018 
“018,813 -018,812 “000,011 *000,010 
*011,260 *011,259 “000,006 | *000,006 
*006,117 | -006,116 *000,004 *000,004 
003,994 | 003,993 “000,002 -000,002 
“002,367 “002,366 “000,001 “000,001 
-001,399 “001,398 




















Here again the fit is surprisingly close. The corrected area of the tail of the 
Type III curve agrees to five figures with the true sum of the series. It is out by 
only 2 in the sixth place. 


Ill. Fita Type IIL curve to the tail of F(30, 30, 111, 2) starting with the 28th 
term. 

The first three forward differences of the common logarithm of the first term 
are, respectively, 

d, = — '200,2261,429, dz = — ‘003,2694,047, 
ds = ‘000,2809,485. 

The equation of the Type IIT curve which passes through the first four terms of 

the tail is readily found to be 
Pe hs cae hg SEF ethos 5 ee ee (v), 
where Yo = '796,454/y (at z= 0). 

The true sum of the tail of the series is 2°117,841 and the area of the tail of 
the Type III curve (i) corrected by the Euler-Maclaurin theorem is 2°117,662. The 
fit of the curve to the series is not so good as in the previous two examples, but 
still, it is an improvement on any of the previous methods. 
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Terms of tail Ordinates of (i) Terms of tail Ordinates of (i) 
*796,454 *796,454 -001,283 “001,272 
*502,267 *502,267 -000,762 “000,752 
*314,369 *314,369 -000,451 “000,444 
"195,414 *195,414 -000,267 “000,261 
*120,708 *120,706 *000,157 -000,153 
074,135 °074,127 -000,093 -000,090 
045,291 *045,279 -000,054 -000,052 
*027,537 027,520 -000,032 “00,030 
016,669 *016,649 -000,019 -000,018 
-010,049 *010,029 “000,011 “000,011 
*006,036 *006,017 -000,006 *000,006 
*003,614 003,597 -000,002 -000,002 

| 002,157 “002,142 “000,001 -000,001 





§10. It was shown in §1 that when the number of significant terms of the 


hypergeometric series F'(a, 8, y, «) is not small, the curve for which dy is equal to 


dz 
Ay,/{$e (Yr + Yr+a)}, 7 = 1, 2, 3, ..., fits the series quite, well around the mode but may 
deviate a little towards the extreme tails. The probability integral of the series 
may be replaced by that of a continuous curve of the type G(z)=y,e~* P(z) with 
sufficient accuracy for most statistical purposes. 

An examination of the tables in the latter part of §1 will reveal that the 
ordinates of the curve G(z) correspond more closely to. the terms of the series than 
the areas under the curve. If, therefore, the constant of integration, yo, is calculated 
by equating the maximum term of the series to the corresponding mid-ordinate of 
the curve, the mid-ordinates around the mode will agree even more closely with the 
terms of the series. The deviation towards the tails, however, will be mere 
pronounced. If, therefore, a more accurate determination of the terms around the 
mode is required, the following method may be used: 

Let G(z) be the curve obtained from the differential equation 1 dy ae : 
y dz c (Yr+Yr+a) 
the constant of integration being determined by equating the modal term of 
the series to the corresponding mid-ordinate of the curve (note that this is not 
necessarily the mode of G(z)). The approximate value of the sum of a number of 
terms near the mode is, therefore, 


s=[“G@adc+F, 


where 2, and zg are the appropriate limits and # the Euler-Maclaurin correction. 
A close approximation to # may be found as follows: 


If y, is the (7 + 1)st mid-ordinate of the curve, the area under this part of the 
curve is given approximately by 


1 <4 , 
5760 [5178y, + 308 (Yr+a + Yr—-1) —17 (Yrae+ Yr-2)]3 


the grouping unit ¢ is taken to be unity. 
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Hence, if we wish to sum the terms from the sth to the ¢th, 
t t 
| G(2)dz= &  y,—0887,153 (yore + ye) 
s r=s+l 


+ °1578,127 (Yss3 + Yea) — °2411,459 (ys42 + yr_a) + °1220,485 (541 + 2). 
Therefore, 


t t 
cen | G(2) dz + 0887,153 (ys4a-+ Yrs) 
1 s 


r=st 


— -1578,127 (43+ Ye_2) + 2411459 (yore + Yea) —°1220,485 (yous + yr) 


The terms to the right of the integral make up the correction. Ly, is 
approximately equal to the required sum of the terms of the series near the mode. 


When the fourth element « is not small, the method given in §2 for finding 
the partial sums of F(a, 8, y, 2) is quite satisfactory for all parts of the series except 
the start. The curve deduced by this method fits the series very closely at points 
bevond the mode but may deviate appreciably at the start. The range of disagreement 
extends rapidly towards the mode as « becomes small. 


The particular method to adopt for finding the sum of a number of terms, when 
large, of the hypergeometric series depends on the type of series and the position of 
the terms relative to the mode. It will be sufficient to consider sums of the type 
S;(=sum of the first ¢ terms) because the sum of any number of successive terms 
may be expressed as a difference of two S’s. 


1. When the last term of S;, (i.e. the tth) falls outside the range mode + 2°5c, one 
of the methods of §7—9 may be used for evaluating S,. The list of methods in 
order of accuracy is as follows: 


(i) Fit a geometrical series a, = r* to the tail and obtain the approximation 
s=0 
S; (p. 93). 
(iia) Fit the series ay = r* p,**-»? and obtain the approximation 8, (p. 93). 
s=0 
(1b) Use the equivalent method of p. 98 (Camp’s method). 


~ s(s—1) 8 (s —1) (s— 2) 
(iiia) Fit the series ag = r*p, 1-2 gq, 1-2-3 and obtain the approximation 
s=0 


Ss (p. 94). 
(111 b) Correct the result obtained by (iv b) as in p. 97. 





(iv) Fit a Type III curve to the tail of the series by either 
yP 7 
(a) equating the logarithmic differences at the stumps, or 


(b) equating the logarithmic differentials at the stumps. 


The required sum may then be expressed by an incomplete Gamma function 
(pp. 103—104). 
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(v) Any degree of accuracy may be obtained by taking a sufficient number of 
s(s—1) s(s—1)(s—2) s(s—1)(s—2)(s—3) 


ao 
3 ; 1.2 1.2.3 2.3.4 
constants in the series a = r* pi n Uy cs ... and ex- 
s=0 





pressing the sum of the tail in terms of incomplete normal moment functions. 


2. When the tth term lies near the mode, then for 
(i) « small (< 4), use method of §1, 


(ii) # >4, use method of § 2. 


3. For J-shaped series, use method developed in § 2 whatever the value of z. 


The partial areas of any bell-shaped continuous distribution, f(z), whose 
logarithmic differentials at any point are known, may be found by applying the 
methods of §§3—6. In addition to those given for finding the area of the tail, we 
may also use the method of §9 where the tail of a Type III curve is fitted to the 
tail of a continuous distribution. Clearly if w(z) be the Type III curve fitted by 
this method tc the tail z > 2 of f(z), then 


f(Q)=u(O exp. (aS 4/4! +5 9/5! + ...), 
where (= z—z and a; the ith logarithmic differential of f(z)/u(z). Moreover, 
F(Q)=U(E) [1 + ag F 4/414 as F9/5 14...) 
The area iy J (2) dz of the tail of f(z) may then be expressed in terms of in- 


z 
complete Gamma functions. 
plete Gamma functions 


In conclusion I wish to record my indebtedness to Professor Karl Pearson for 
suggesting this problem to me and for his helpful advice and criticism. I wish also 
to thank Miss Kirby for preparing my diagrams. 








DIE STATISTIK DER SELTENEN EREIGNISSE*. 
Von ROLF LUDERS. 


I. Term. THEORIE. 


§1. Hinleitung und Problemstellung. Werden n Individuen eines Kollektivs 
auf das Vorhandensein eines gewissen Merkmals gepriift und ist bei jedem 
Individuum die Wahrscheinlichkeit fiir das Vorhandensein dieses Merkmals q, 
so ist die Wahrscheinlichkeit dafiir, dass genau r Individuen das Merkmal besitzen, 
bekanntlich 


P,=(") qr a-g". 


Diese Formel, die hiufig Newtonsche Formel genannt wird, lasst sich aus dem 
binomischen Satz leicht herleiten F. 


Wird die Anzahl n der Individuen unendlich gross und konvergiert g gegen 
Null derart, dass n.q =m endlich bleibt, so erhalt man die sogenannte Poissonsche 
Formel 


Diese Formel stellt zugleich eine Niiherungsformel fiir den Fall dar, dass n sehr 
gross und q sehr klein ist. Die Grisse m bedeutet dabei den Mittelwert, d. h. die 
durchschnittliche Zahl der Individuen, die das Merkmal besitzen. 


Die Poissonsche Formei lisst sich in zahlreichen Fallen auf die Statistik der 
seltenen Ereignisse, wie der Unfalle, Krankheiten oder Selbstmorde anwenden. 
Bei Unfillen ist das Kollektiv die einem Unfall ausgesetzte Bevilkerung, das 
Merkmal ist das Eintreten eines Unfalls. Stellt man die Zahl der Unfalle etwa 
fiir eine Reihe von Monaten fest, so kann man die beobachtete Haufigkeit einer 
bestimmten Zahl von Unfallen im Monat mit der aus der Poissonschen Formel 
berechneten Hiufigkeit vergleichen. 


Die Poissonsche Formel gilt aber nur unter folgenden Voraussetzungen : 
(1) Die Wahrscheinlichkeit fiir den Eintritt des Ereignisses muss in allen 
beobachteten Monaten dieselbe sein, sie darf nicht von der Zeit abhiangen., 
(2) Die Wahrscheinlichkeit muss sehr klein sein, da unter dieser Voraus- 
setzung die Poissonsche Formel abgeleitet worden ist. 
* Dissertation zur Erlangung der Doktorwiirde der Mathematisch-Naturwissenschaftlichen Fakultat 
der Hamburgischen Universitit, 1933. 


+ Mises, Vorlesungen aus dem Gebiete der angewandten Mathematik, I. Bd., Leipzig-Wien, 1931, 
8. 128. 
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(3) Die einzelnen Ereignisse miissen unabhiangig von einander sein, durch 
das Eintreten eines Ereignisses darf die Wahrscheinlichkeit nicht geindert werden. 
Es darf also keine Wahrscheinlichkeitsansteckung bestehen. 


Lasst man die letzte Voraussetzung fallen, so kommt man zu der von Eggen- 
berger und Pélya aufgestellten Formel, die im folgenden stets Eggenbergersche 
Formel genannt wird*. Nach dieser Formel ist die Wahrscheinlichkeit dafiir, dass 
genau r Ereignisse eintreten, 


h 
P, =A h(h+ d) (h + 2d)...(h+(r—1)d).(1+d) a" (r=1, 2, 3,...); 
; h 
und es ist Pp=(1+d) 4. 


Dabei ist h der Mittelwert, die Grisse d ergibt sich aus der Beziehung 
h(i+d)=pe2,' wo peo= > (r—m/y P,, 
r=0 


das Mittel zweiter Ordnung (Quadrat der mittleren Abweichung oder Streuungs- 
quadrat) ist. 


Die Eggenbergersche Formel findet Verwendung in der Statistik der ansteck- 
enden Krankheiten. Ihre Ableitung aus dem Urnenschema erscheint aber, worauf 
in einem Artikel von H. Pollaczek-Geiringer+ hingewiesen wird, sehr gekiinstelt. 
In demselben Artikel gibt nun die Verfasserin eine Formel an, die einen weit 
allgemeineren Ansatz fiir die Theorie der seltenen ansteckenden Ereignisse bietet. 


Diese Formel, zur Abkiirzung als “Forme! I” bezeichnet, soll im folgenden von 
einem speziellen Standpunkt aus hergeleitet werden. Darauf sind einige Spezial- 
falle zu diskutieren, ausserdem ist der Zusammenhang mit der Eggenbergerschen 
Formel zu klaren. Ferner soll die Frage der Mittelwerte einer kurzen Untersuchung 
unterzogen werden. 


Die Ergebnisse der Theorie sollen im zweiten Teil an Hand einiger Beispiele 
nachgepriift werden. 


§ 2. Ableitung der Formel I. In der Statistik der seltenen Ereignisse ist 
die Poissonsche Formel hiufig deswegen nicht giiltig, weil nicht nur Ereignisse 
beobachtet werden, die einzeln und unabhiingig von einander eintreten, sondern 
auch Ereignisse, die zu je zweien oder dreien eintreten. Das gilt etwa bei Unfillen: 
Durch eine Ursache kann eine Person zu Schaden kommen, es kénnen aber auch 
zwei, drei oder noch mehr Todesfille durch eine Ursache eintreten. 

Es sei h, die mittlere Anzahl (der Mittelwert) der einzeln auftretenden 
Ereignisse, hg die mittlere Anzahl der Paare von Ereignissen, hg die mittlere 
Anzahl der Tripel von Ereignissen, allgemein A, die mittlere Anzahl der n-fachheit 
von Ereignissen. Dann ist die mittlere Anzahl der Ereignisse tiberhaupt 


m = hy + 2he+ 3hg +... +nhyt.... 


* Pélya und Eggenberger, Zeitschrift 7. angew. Math, u. Mech. 1. 279, 1923. 
+ Zeitschr. f. angew. Math. u. Mech. vu. 292, 1928. 
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Nun sei die fundamentale Annahme gemacht, dass die Wahrscheinlichkeiten 
fiir das Auftreten von 1, 2, 3 bzw. allgemein n zusammengehirigen Ereignissen 
durch die Poissonsche Formel bestimmt werden und von einander unabhingig 
sind. Dann ist die Wahrscheinlichkeit dafiir, dass genau v; Einzelereignisse 
vorkommen, 





entsprechend die Wahrscheinlichkeit fiir das Vorkommen von vg Paaren von 
Ereignissen 


—h v. 
Pao tla 
-_ V2 ! 


und ganz allgemein fiir das Vorkommen von y, n-fachheiten von Ereignissen 


P,, (my = Om hn’ 
i Vy! 
Wegen obiger Voraussetzung der Unabhiingigkeit der Verteilungen von einander 
ist die Wahrscheinlichkeit fiir das Auftreten von r Ereignissen iiberhaupt 


P,= > P,® P,P... Pr, 


Vj»t+QWot...+rvr=Pr 


v v 
P,, = e-bay, —-- s hy” wh 2... h, r 


Vy, +2Qvgt...trve=P V1! ve! ... vy! 
Aus dieser im folgenden als “Formel I” bezeichneten Formel ergibt sich speziell 


Po = e7sas—, 
> 2 ) ,) hy? 2 ”) his 
I 1=Po.hi, I »= Pol, + ha) Py=Po( 3, +hyhe + hs) ° 
§3. Der Mittelwert und die Mittel héherer Ordnung bei der Formel I. Zuniichst 


ist die erzeugende Funktion der Formel des vorigen Paragraphen zu bestimmen. 
Diese Funktion erhailt man aus 


S D S is hy’thg”2 eee 
S(@=iPeva2tecrers'sr- F&F —. 
r=0 r=0 Vj t2r,+...=7 Vii Ves oes 
= hyz)"1(hgz?)’2... 
= eh hy-.. 5 > (/2)" (Os. ) : 
. r=0 4, +27,+..=7 vy! vg!... 
da 2° = 2%1t2"st-- ist, 
; 2 (hyz)" 2 (haz?) 
Weiter folgt FS (2) =e = a St = (oe ) ats 


v,=0 4! v.=0 Vg! 
Die erzeugende Funktion der Formel I ist also 


Ff (2) Sea tata— , ghz tgatthigd+.. 
Zuniichst ist fQ)=1, dh. & P,=1, 
r=0 " 


eine Bedingung, die bekanntlich notwendig erfillt sein muss. 
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Um die Mittelwerte zu erhalten, hat man zu differenzieren. Setzt man zur 
Abkiirzung 
h(z)= —hy—he- hg — pak + hyz + hes? +hg2* + oe 


so erhalt man f(a=e®, f' (=k (2)e®, 
f()={N (AP+K" (AM, f(z) = {hl (e+ Bh’ (e)hl" (2) +h" (2)} eh. 
Nun ist h(l1)=0, h’(1)=hi + 2heg + 3hg + 4hgt..., 


hi" (1)=2. hg +3. 2hg+4.3hg+..., bh’ (1)=3.2. 1hg + 4.3. 2hat.... 
Man erhalt also zunichst (f’(1)=/y + 2he+3hg+.... 


Da nun der Mittelwert m= % rP,=f"'(1) 
r=0 
ist, so ergibt sich wie im vorigen Paragraphen 
m = hy + 2he+3hg+.... 


Ferner ist FS’ (1) =m? +2. 1he +3. 2hg + 4. Bhat... 
Andererseits gilt fiir das Mittel zweiter Ordnung 


oe) « 
po= =X (r—myP,= 2 r(r-1) P,+m—-m* 
r=0 r=0 
=f" (1) +m—m; 
also wird Me = hy + 2het 3hz + 4hat... 


+2.1he+3.2h3+4.3hq4+... 
=hy + 4hog+ 9hg + 16hg4+..., 
oder Me = hy + Whe + 37hg + 4hgt .... 
Endlich ist 
FA) = m8 + 8m (2. 1hg + 3. 2hgt+ 4. 3hg4+ ...)+3.2.1hg4+ 4.3. 2hg4.... 
Nun gilt fiir das Mittel dritter Ordnung 
bs= > (r—m) P, = Sr (r—1)(r—2) P,+3(1—m) Er (r —1) P+ m—3m?+ 2m® 


r=0 
=f'" (1)+3(1—m)f" (1) + m— 3m? + 2m; 
also wird 
fg =m? + 3mh” (1) +h’ (1) + 3m? + Bh" (1) — 8m> — 3mh” (1) + m — 3m? + 2mF 
=h'"’(1)+ 3h" (1) +m 
=hy + 2he+3h3+ 4aq4+... 
+3.2.1hkg+3.3.2hg+3.4.3ha4+... 
+3.2.1hg+4.3.2hat... 


d. h. aber bs = hy + She + 27hsg+ G4ha+ ... 
oder Bs = hy + Who + Shs + 48h Pissee 
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Fiir die Berechnung der Parameter stehen also zunichst die folgenden drei 
Gleichungen zur Verfiigung : 


m=hy+2he + 3hs + eeey 
fz =hy + 2he+ Bhgt ..., 
Ps =hy + Bhet+3%gt.... 


Die Mittel vierter und héherer Ordnung diirften fiir die praktische Verwendung 
kaum in Frage kommen, da sie mit zu hohen Fehlern behaftet sind. 


§ 4. Die Eggenbergersche Formel als Spezialfall der Formel I. Es ist jetzt zu 
zeigen, dass die Formel von Eggenberger sich als Spezialfall der Formel I ansehen 
lisst. Sollen die Verteilungen nach diesen Formeln iibereinstimmen, so muss die 
Gleichung 

h 
en am hg—hs— -- = el + d) 4, 
und ausserdem die Gleichung 


h 
s hy hg”: hg”s ose 


h(h+d)...(h+(r—1)d) 











—h,—he—hs— = ( ~d 
Jey: SO ee FF een: rid+ay ; 
fiir r= 1,2, 3, ... erfiillt sein. Es miissen demnach die folgenden Gleichungen 
gelten : 
hk 
hy = 7 d eee eee ee eee eee ee ee eee eee ee ee eee ee (1), 
hy h(h+d) - 
a1 tes asa) ee Rr ees 4 ear RA (2), 
h;® _ h(h+d) (h+ 2d) 
31 + Iyho+ hs =— 31 a +a Cocccccvceesevesccese (3), 
usw. 
Aus (2) erhailt man unter Benutzung von (1) 
h? ree h? hd 
21(i+d * 210 +a)? * 20 4a)?’ 
fe hd 
a= oa +a 


Analog ergibt sich aus (3) unter Benutzung der Gleichungen (1) und (2) 








BB i”... * hd hd? 
31d +ayp* 20+dp *~ 31a +ap* 20 +ay* 30 +a)’ 
coe 
8304+)" 


Setzt man allgemein fiir n= 1, 2, 3, ... 
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so wird aus der erzeugenden Funktion der Formel I (§ 3) 





FC z) oa enya hy—hs— ~ ehyzthgztths+ : 
h{ dz d*z* dé 2* 
2 oie Poot 
wegen hyz +he2z* +hg2? +... dlit+d sasatsasapt} 


=-Gin(1-;), 


f= a ” (1-753) eam (-;5) 
h 


die Funktion 


h _& 
4(1-,%,) MERC (5). 


Andererseits ist bei der Eggenbergerschen Verteilung, wie Eggenberger nach- 
gewiesen hat*, die erzeugende Funktion 


f(2)=(1+d) 








h 
- <r -5 2 h(h+d)...(h+(r—1)d) _) 
z)= > P,z’=(1+d afy > - 2} 
i) r=0 ( \ nce 2 r!(1+dy ) 
h 
-={ S + (-5)(-3 (-3 \ a 
= d > _— _-— “in —_-a .t- 
Cre tht £5 3) ( g—1)--(-g-"+1) ( 1+d) | 
h 
-= ( = 7) ( dz y 
= d = (-— =) (-=—S} }. 
allie i+ = ( d),\” 14d) J’ 
. ] ‘ h 
fii “2 eek. an d pn 
f(Q=(1+a) (1 peal (6). 
Hieraus folgt wegen (5), dass die erzeugenden Funktionen identisch sind, d. h. 
dass f(z) =f (2), 
oder > P,2= > P,2z* ist 
7=0 r=0 
Setzt man z= 0, so folgt Py =P, 
Ferner ist fiir k = 1, 2, 3, ... 
a* we) 7h . 
act (= Ps) = a5 (= Ps"); 


setzt man wieder z= 0, so erhilt man die Gleichungen 
P,=P, (E = 1, 2, 3, ...). 

Sind also die erzeugenden Funktionen zweier Verteilungen identisch, so stim- 

men beide Verteilungen iiberein. Damit ist bewiesen, dass die Eggenbergersche 

Formel ein Spezialfall der Formel I ist. Man erhalt die Formel von Eggenberger 


wenn man bei der Forme! I 


hd? 
= n=1, 2 3, ...) 
"~n(1+d)" ‘ 
setzt. 
* Die Wahrscheinlichkeitsansteckung, Bern, 1924. 


Biometrika xxv1 








114 Die Statistik der seltenen Ereignisse 


§ 5. Spezialfille der Formel I. Die Formel I ist zunachst nicht zu verwenden, 
da unendlich viele Parameter hj, he, hs, ... bestimmt werden miissen. Es lassen 
sich aber spezielle Annahmen iiber diese Gréssen machen, sodass nur eine endliche 
Anzahl von Parametern zu berechnen ist. 


Die Poissonsche Formel erhalt man, wenn man 
he =hg=ha= ... =O 
setzt; denn dann ist h=m 


em . m 


und P= : (r=0, 1, 2, ...). 


' 





Man kann auch annehmen, dass iy und hg von Null verschieden, dagegen 
hs=hy=... =0 ist, d. h. dass die Ereignisse entweder isoliert oder zu Paaren 
auftreten kénnen, jedoch nicht mehr zu dreien, vieren usw. Unter dieser Voraus- 
setzung ergibt sich die Formel Ia: 

s hythg”2 


P, a enya he 2 ae t 
¥,+2y,=7 V3! ve! 
Dabei ist hy+2hg=m, hy + 4he= pe, 
also hy =%m— Pe, he = SS 7 


Die Parameter lassen sich auf eine recht einfache Weise berechnen. 


Analog erhalt man unter der Voraussetzung, dass héchstens drei zusammenge- 
hérige Ereignisse beobachtet werden, 


hg=hs=h= ee O 
Die entsprechende Formel Id lautet : 
P ae em hy-ha—hs s hy"thg”2hg"s 
wah 1 = ' ' <é 
V,+2vg+3r,=r Y1- Ve+ V3: 
Dabei ist hy+2hg+ 3hg=m, 


hy + Ahe a 9hs = pe, 
hy + She + 27hs = Ms, 


woraus folgt hy=38m — » be » e, 
3 
ha= —g™t 2a —F, 


m be Ms 
jee Sy Be. 
me 2 6 

§ 6. Die Formel II, eine Erweiterung der Eggenbergerschen Formel. Man 
erhiilt, wie in § 4 gezeigt worden ist, die Eggenbergersche Formel aus Forinel I, 
wenn man die /y, he, hs, ... in der folgenden Progression abnehmen lisst : 


2 3 n—1 
hi, hy = 2 he = ME os a a a ae 


Eis 3.” 4 af n 




















Rotr LiprErs 115 





wog=, q ist. Nun hat die Formel von Eggenberger aber nur zwei Parameter 


(h und d). Um eine Formel mit drei Parametern zu bekommen, liegt es nahe, die 
hy, he, hg, ... in der Weise zu spezialisieren, dass man nach Festsetzung von hy 
und he erst die hg, ha, hs, ... in entsprechender Progression abnehmen lasst: 


‘ 2 n—2 
hy, h,=£, hg =", hy = PE, ery h, = Pt, 
Nun ist m = hy + 2he+ 3hs + 4hat..., 
also erhalt man m=h+pQ+qt¢@t...j=hit =: ; 
Ferner ist fe = hy + 4he + Dhg t+ 16hg+ ... 
=hy+ p(24+ 3q + 497+ 59+ ...). 
| 1 
Es ist jedoch 2+ 39 + 497+... =— + ——_,. 
Ss J q Y 1-q (l-q)? 
Daher i =h,+ ( S lee 
aher ist fa=h+p er aap) 
Endlich gilt Bs = hy + She + 27hs KG 64h4 + 125hs ae 


=h, + p(4+ 9q + 169? + 259° + ...). 


1 1 1 
Nun ist aber 4+9 24 259% +... =_— oe + ~~ 
Nun ist aber 4+ 9q + 169? + 259% + i-<*G-@ ‘a—-e 
Man hat also das Gleichungssystem : 


m=hy+ EERE (1), 
1-q 
1 1 
fea=hy+p Ga nt Serer errr errr reer er rrrrerrrr) (2), 
h+p(7~0+q-osta—) (3) 
=h : eee © q 
Pe Fine Geer a= 
5 aaale me SEs 
Hieraus folgt i He — Mm, 
2p a 
(1 gq Be; 
durch Division erhalt man l-q= — 
5 M3 — Be 
ferner p =(#2—m) (1 —q)*, 
und h=m- : 
1-q 


Zur Berechnung der P, hat man zuniichst 


Po = ea hy hg hs— 
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auszuwerten. Nun ist 
2 


hy that hst..=nn+b+ety Pry... 
hy ade lon Cae 4 ) 
=hy P+A(o stetgt 


PP 
~ -h+—+5,m(1- q) 
Also wird Po=e q¢ 


Pp 


ee reas 


ee 
=(1—q)!’.e “a, 


hy"thg’2hg”s... 
und man erhalt P,=P, > Sr Ee 
yy +2vot+8v,+...=r V1: Ve: Vgi... 


wobei man fiir die h;, he, hg,... die zu Beginn des Paragraphen angegebenen Werte 
einzusetzen hat. 


Die hergeleitete Formel (im folgenden als Formel II bezeichnet) hat gegeniiber 
der Eggenbergerschen Formel den Vorzug, dass die P, von drei Parametern, niim- 
lich h, p, g, abhangen. Sie hat allerdings den Nachteil, dass zur Berechnuny der 
Parameter auch das Mittel dritter Ordnung ys gebraucht wird, das bekanntlich mit 
einem ziemlich hohen wahrscheinlichen Fehler behaftet ist. 


Fiir p=hig 
erhalt man wieder die Eggenbergersche Formel; es ist dann 
h 
ne 1+d’ 
se d 
114d’ 
: _p. ” ial ~ hd»-* a: 9: 
also h,= ,. Sathae (n=1, 2, 3,...). 


Die Formel II stellt daher zugleich eine Verallgemeinerung der Formel von 


Eggenberger dar. 


§7. Modifizierte Mittelwerte. Die betrachteten Formeln, vor allem die Formel 
II, haben den Nachteil, dass zur Berechnung der Parameter neben dem Mittelwert 
m noch die héheren Mittel w2 und vs bendtigt werden. Nun ist bekanntiich der 
mittlere Fehler von pe 

[nt +m 
Eus =) aa 


15m? + 24m? + m 
der von ps eu, = 7 
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wo WN die Zahl der beobachteten Zeiteinheiten ist. (Diese Ausdriicke gelten, streng 
genommen, nur im Falle der Poissonschen Verteilung; sie sind aber auch bei 


anderen Verteilungen anwendbar, wenn es nur auf die Gréssenordnung der Fehler 
ankommt.) 


Fiir gréssere Werte von m sind die obigen mittleren Fehler offenbar recht hoch. 
Die Parameter der Formeln, die aus den Mitteln berechnet werden, sind dann so 
ungenau, dass auf diesem Wege die theoretische Verteilung nicht gefunden werden 
kann. 


Der grosse Fehler der héheren Mittel kommt daher, dass in der Formel 
my = = (r—m> P,, 
r=0 


schon von k=3 an die Faktoren (r—m)* zu gross und deshalb die Summanden, 
mithin auch die Summe, zu gross werden. Dasselbe gilt von den Ausdriicken /’ (1), 
J’ (A) und f””’ (1) des §3. Aus diesem Grunde liegt es nahe, jeden Summanden 
mit einem Faktor zu multiplizieren, der fiir grosse r sehr klein wird. 


Ks soll die folgende Funktion eingefiihrt werden : 
O18) 2, PAOE  o cesesncecerseessse Man useeceuenen (1), 
r=0 


die der in §3 definierten erzeugenden Funktion entspricht. Dabei ist # eine Zahl 
zwischen 0 und 1, Fiir z= 1 erhalt man die folgenden Ausdriicke: 


dat id (2), 
et 1c Meee meee meee (3), 
¢" = Fe RRA cctisicccled ceca ee ee (4), 
$= Er(r— Tees |e ene (5). 


Die drei letzten Ausdriicke werden im folgenden modifizierte Mittelwerte genannt. 
Durch entsprechende Wahl von « (z. B, c=}4) lisst sich erreichen, dass die Summanden 
nicht zu hoch werden und so die mittleren Fehler obiger Ausdriicke gering bleiben. 


Die Formei I lautet nun 


P= Ss 
- wand {7 ot 
vy, +2Qvygt8ryt+...=r Vis Ve: Vg: er. 


hy"rhe’2hg’s .. 
oS gt’ _ 


Also wird ahnlich wie in §3 
d (z) a Qa higahg—.- Hh TZ iga2Z® +hgT3Z54.... 
oder ri (z) ant eh 2), 


wo h (2) = —hy— he— hg — ... + hye + hgv®z? + hgwFe? + 20. 2... eee (7) 
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ist. Fiir z= 1 erhalt man b =e", 


rink: gs Sean et 2S a ae Pe (8), 
Oo IA FD. Gan serene cccigaeess venes a (9), 
Sra bh tM GRD i kick: (10). 

Dabei ist h=—hy—he—hg— ... that hex® + hga? +..., 
ec Dg H Oe cn” ovis na nnacewenncovkaciyeeseon<ayites (11), 
A ee ge” DM Face | Gicneeen isis svnl veda semen denctas dndebuseine (12), 
“iy SO. PSR ORE TERRY A Atlee Dl A nd (13) 

Aus den Gleichungen (8), (9), (10) folgt 

va > aie abe fe es Sele ee (14), 
RO EER es AES om (15), 
ee Eek OR RNa oes (16) 


Beschriinkt man sich auf die Formel I 6, so kann man aus diesen Gleichungen 
die h’, h’’, h’” berechnen und dann aus (11), (12), (18) die Werte fiir hy, he, hs. 


§8. Die Methode der modifizierten Mittelwerte in Anwendung auf Formel II. 
Zur Berechnung der Parameter der Formel II kann man sogar auf die Ausdriicke 
(4) und (5) des vorigen Paragraphen verzichten, indem man drei verschiedene Werte 
fiir 2, namlich a, #2 und a3, benutzt. Durch Einsetzung dieser Werte in (11) ergeben 
sich dann drei Gleichungen zur Berechnung von hy, p und q. 

, 2 

Nun ist wegen hp = f hs =. ha= PY 
h’ = hyx + Qhea* + 8hga*® + 4hga* +... 


> tee 


=h,« + pa + pqau® + pq?at + .. 


a px 
= & {hs + e | ’ 


pe _W 
oder hy + ing a Or eee (1). 


Fiir die rechte Seite dieser Gleichung gilt wegen Gleichung (14) des vorigen Para- 
graphen 


a}. 
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Die drei Gleichungen fiir h,, p und q lauten daher 








1 —_ 
hy + i eee eescccccesesecsesesescesees (3), 
tat 
Bat da cessseseeeeeeeetttttnnnnnnnneens (4), 
hy + (- Se eee (5) 
— gx3 
Aus (3) und (4) folgt 
an OO x. EIS TPES 
= qa pS qae dy ds, 
d. h. P (a —_ 22) = (d; co dz) (1 — qa1) (1 =e q&2). 


Analog ergibt sich aus (4) und (5) 
p (2 — #3) = (dz — ds) (1 — gre) (1 — gars). 
Durch Division erhalt man 
1— qv1 = Ty — X2 ds — ds 
l—ga3 «—#x3 d—de 
Aus dieser Gleichung lasst sich q leicht berechnen, nachdem die Werte der 
d,, dz, dg entsprechend der Gleichung (2) festgestellt worden sind. 


Il. Tern. ANWENDUNGEN. 


§ 9. Vorbemerkungen zu den Beispielen. Die im ersten Teil besprochenen 
Formeln werden im folgenden Teil auf verschiedene Gebiete der Statistik der 
seltenen Ereignisse angewandt. 


Zur Berechnung der Parameter der Formeln werden benutzt: 


° Ee 
der Mittelwert m= = eos bansiyeetGeentes bul enh wanistaepn ene (1), 
2 (r—mPP : 
das Mittel zweiter Ordnung pe= = = be wc ok vba inch cwebeess aeeaem (2), 
ros) 8 P 
und das Mittel dritter Ordnung ps= > ihe. Kl (3). 


Dabei ist N die Anzahl der Zeitintervalle, iiber die sich die Beobachtung ausdehnt, 
und P, die Anzahl der Zeitintervalle, in denen r Fille beobachtet worden sind, 
sodass 


= . 
> P,=N ist. 


r=0 


Zu bemerken ist hier, dass der wahrscheinlichste Wert des Mittels zweiter 


Ordnung 
S (r—mPP, 
r=0 N- 1 












: 
i 
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ist. Gleichwohl ist das Mittel zweiter Ordnung stets nach Gleichung (2) berechnet 
worden. Der Fehler kann fiir grosse NV vernachlissigt werden. Bei dieser Methode 
vereinfacht sich die Rechnung durch folgende Gleichungen: 
_ 3 (r—m+ 8¥P, 
” a * N 
i) ees 3 
_ = (amt oP, 


r=0 


— 82 


ps — 33, — 3 


Hier ist 5 eine Zahl zwischen 0 und 1, die so gewiihlt ist, dass m—6 eine ganze 
Zahl wird (m —6 ist ein “vorliufiger Mittelwert,” nimlich die dem Mittelwert m 
naichstkommende ganze Zahl). 

Was die Genauigkeit anbelangt, so ist der mittlere Fehler des Mittelwertes m 
= 
N > 


Em = 


der des Mittels zweiter Ordnung 


oo +m 
= a/ —az 
#2 N od 


und der des Mittels dritter Ordnung 


15m? + 24m? + m 
Cn, = V f 


(Betreffs der beiden letzten Ausdriicke gilt das in § 7 Gesagte.) 








Um die Giite der Anpassung der theoretischen Hiufigkeiten an die beobachteten 
zu priifen, ist das Pearsonsche x?-Kriterium* benutzt worden. Aus der Grisse 


2_y(P,—P,) 

oe, GaN 

wo die P, die beobachteten, die P, die theoretischen Haufigkeiten sind, erhalt 
man mit Hilfe der Pearsonschen Tabelle die Wahrscheinlichkeit P dafiir, dass die 
Abweichungen von den theoretischen Werten ebenso gross oder grisser als beo- 
bachtet sind. (Die Hiaufigkeiten fiir grissere r sind bei der Anwendung des 
Kriteriums zusammenzufassen.) Je niher P an 1 liegt, um so besser ist die 
Anpassung der Theorie an die Beobachtung. 


x 


Grundsiitzlich ist noch zu bemerken, dass eine Formel sich um so besser der 
Beobachtung anpasst, je mehr Parameter sie besitzt. Das gilt insbesondere bei 
Verteilungen mit sehr kleinen Mittelwerten. In solchen Fallen ist die gute 
Anpassung einer Formel mit drei Parametern noch kein Beweis dafiir, dass 


das Schema der Formel (etwa die Ansteckung) den tatsiichlichen Verhiltnissen 
entspricht. 


Die in den folgenden Tabellen angefiihrten theoretischen Hiiufigkeiten sind 
im allgemeinen auf eine Stelle nach dem Komma abgerundet. Dadurch ist es zu 


* Pearson, Tables for Statisticians and Biometricians, Cambridge, 1914, Table XII. 
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erklaren, dass die Summe dieser Zahlen in einigen Fallen von der Summe NW der 


beobachteten Haufigkeiten um eine oder zwei Einheiten der ersten Dezimale 
abweicht. 


$10. Todesfille durch Eisen- und Strassenbahn im Saargebiet. Das Statistische 
Amt des Saargebietes veriffentlicht die monatliche Anzahl der durch Eisen- und 
Strassenbahn verursachten Todesfille*. Es sollen die Monate Januar 1925 bis 
Dezember 1929, also insgesamt 60 Monate, und zwar nur die Todesfalle von ménn- 
lichen Personen untersucht werden. Tabelle I gibt in Spalte II die Anzahl der 
Monate an, in denen die in Spalte I angegebene Zahl von Todesfillen vorgekommen 
ist, also die beobachtete Haufigkeit der Todesfalle. Der Mittelwert betrigt m=1-383, 
d. h. es sind monatlich im Durchschnitt 1383 Todesfalle eingetreten. 


Nach der Formel von Poisson ist die Wahrscheinlichkeit dafiir, dass genau r 
Todesfalle waihrend eines Monats vorkommen, 


also die theoretische Hiiufigkeit, da 60 Monate beobachtet worden sind, 
o* am? 

rl 
Spalte III der Tabelle I enthalt die auf diese Weise berechneten Haufigkeits- 
zahlen. 


P,=60 (r=0, 1, 2, ...). 


Um die Giite der Anpassung an die Beobachtung zu priifen, hat man nach 
dem Pearsonschen x?-Kriterium die Grisse 
ae Pp \2 
eo NS (P, — P,) a 
ee 
zu berechnen, Dabei sind die Haufigkeiten von r=4 ab zusammengefasst; die 
Anzahl der Summanden ist demnach n’=5. Auf Grund der Pearsonschen Tabelle 
besteht dann die Wahrscheinlichkeit P= 0°45 dafiir, dass die Abweichungen von 
der theoretischen Verteilung mindestens ebenso gross wie beobachtet sind. 


3:72 


Bei der Poissonschen Formel miisste 
mM = le = Ms 
sein, hier ist aber m=1°383, pe= 2036, pws=3'585. 
Unter Beriicksichtigung der mittleren Fehler 
€Em=0'18, €,,=0°29, e,, = 12, 


diirften die Differenzen zwischen den Mitteln zu gross sein, als dass man sie noch 
als zufiillig ansehen kann. 


Was nun die Eggenbergersche Formel angeht, so ist 
h=m= 1383, 
1+d="=1472, also d=0472. 


* Bericht des Statistischen Amtes des Saargebietes, Saarbriicken; 5, Heft (1927), bis 9. Heft (1929). 
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h 
Man erhiilt Py =60(1+d)~ a=1931 | 





und die weiteren in Spalte IV angegebenen Hiufigkeitszahlen. Die Ubereinstim- 
mung mit der Beobachtung ist hier besser als bei der Poissonschen Formel. Denn ? 
nach dem Pearsonschen y?-Kriterium ist wegen y?= 0°97 die Wahrscheinlichkeit 
dafiir, dass die Abweichungen mindestens ebenso gross wie beobachtet sind, P= 0°91. 


In dem vorliegenden Beispiel besteht Ansteckung in dem Sinne, dass durch 
einen Unfall auch zwei oder mehr Personen in Mitleidenschaft gezogen werden 
kénnen. Nimmt man eine maximale Zahl von drei Toten bei einem Unfall an, so 
hat man die Formel Ib anzuwenden. Dann ist 


5 
hy =3m—-— 3 Me+ e = 0°852, 


ha=— m+ 2, —“* = 0-205, 


Man erhilt Po = 60. e4- "2's = 20°01 


und die weiteren Zahlen der Spalte V. Diese Werte stimmen ausgezeichnet mit 
den empirischen iiberein. Auf Grund des Pearsonschen y?-Kriteriums ist x? = 0°65 
und P=0°96. 


Zur Kontrolle sei noch die Formel IT benutzt. Nach dieser Formel ist 
1—-g=2. 4" =0848, g=0157; 
q iia = pis q ‘ 
ferner ist p=(u2—m)(1—q)? = 0464. 


Es wird h=m— a = 0'833, 


2 
ha= os = 0°003 usw. 


Bedenkt man, dass insbesondere ys; mit einem recht hohen mittleren Fehler 
behaftet ist, so sind die Abweichungen der obigen Werte der hi, he, hg von den nach 
Formel Ib berechneten als gering anzusehen. hg ist fast gleich Null, womit die oben 
gemachte Annahme iiber die Héchstzah] der Toten bei einem Unfall sich als 
berechtigt erwiesen hat. Die aus dieser Formel zu berechnende Verteilung wiirde 
sich daher nur unwesentlich von der Verteilung in Spalte V unterscheiden. Nach 
Formel II verlaufen von allen tédlichen Unfillen durch Eisen- oder Strassenbahn 
rd. 76% mit einem Todesfall, 21 °/ mit zwei Todesfiillen und 2°/ mit drei Todes- 
fillen. Eine Nachpriifung dieser Daten durch die Beobachtung ist auf Grund des 
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zitierten Berichtes nicht méglich, da Angaben iiber die Zahl der durch die gleiche 
Ursache Verungliickten fehlen. 

Die Eggenbergersche Formel stimmt hier nicht so gut mit der Beobachtung 
iiberein wie die Formel 1b. Das kommt daher, dass nach der Eggenbergerschen 
Formel 





h 
ha = 7g = 0940, 
1a 
he = 3 Teq7 = 0180, 


3 
koe (5) hy = 0-008 


ist. Man sieht, dass diese Zahlen von den entsprechenden der Formel Ib oder IT 
abweichen, dass hier insbesondere hy; zu gross und he zu klein ist. 


Die gute Ubereinstimmung zwischen Theorie und Beobachtung ist hier vor allem 
dadurch begriindet, dass das Material homogen ist. Es betriigt niimlich 








im Jah | die Zahl der 
im Jahre | Todesfille 
——_——_——— I 
1925 | 12 
1926 19 
1927 15 
1928 18 
1929 | 19 





Die Schwankungen von Jahr zu Jahr sind also nicht zu gross. 


TABELLE I. 


Verteilung der mdnnlichen Todesfille durch Eisen- und Strassenbahn im Saargebiet. 

















| 
z Il. IIT. IV. V. 
Anzahl Beobacht, | Verteil. nach | Verteil. nach | Verteil. nach 

der Fille Verteil. Poisson Eggenberger | Formel I} 

0 20 151 19°3 20°0 

1 17 20°8 18°i | 17°71 

2 11 14-4 4 | ona 

3 8 6°6 6°0 } 6"4 

4 2 2-3 29 = | 3-0 

5 - O°6 1:3 13 

6 2 01 0°5 0-5 

7 — 0-0 03 0:3 
und mehr 
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§11. Verteilung der weiblichen Diphteriefalle in Béhmen 1921 bis 1929. In 
einem kiirzlich erschienenen Artikel der Aktudrské Vedy gibt M. Vacek bei der 
Untersuchung der Eggenbergerschen Formel als Beispiel u. a. die weiblichen 
Diphterie-Falle in Béhmen fiir die 108 Monate der Jahre 1921 bis 1929 an*. 


Spalte II der folgenden Tabelle II zeigt die beobachtete, Spalte III die von 
Vacek auf Grund der Eggenbergerschen Formel berechnete Hiiufigkeit. Dabei sind 


der Mittelwert 
m=h=15°8796 


und die Grésse d = 13°6274 


zugrundegelegt worden. Was die Ubereinstimmung mit der beobachteten Verteilung 
angeht, so ist 
x7 = 24°4 
also P=0°08, 
d.h. die Anpassung der Eggenbergerschen Formel ist hier sehr schlecht. 

Das Eggenbergersche Schema ist nicht geeignet, entweder weil das Mittel 
zweiter Ordnung, aus dem d berechnet worden ist, mit einem zu hohen Fehler 
behaftet ist, oder weil die speziellen Voraussetzungen, die tiber die Art der 
Ansteckung zu machen sind, hier nicht zutreffen. Um beide Fehler miéglichst 
auszuschalten, soll die Formel II benutzt werden, wobei die Parameter durch die 
Methode der modifizierten Mittelwerte berechnet werden sollen. Dadurch wird der 
hohe Fehler durch das Mittel dritter Ordnung vermieden. 

Setzt man a=1, 

log a=—O1+, dh. v2=0°'7943, 
logas=—02, dh. 23=0°6310, 
so erhilt man entsprechend der Gleichung (2) des § 8 
d,=15'880, d,=6401, d,=5'254. 
Aus der Gleichung (6) des § 8 wird hier 
1-q 1594 ; ~~ =0-75 
i— 06319 01524, also g=0'9378, p=0°7315, 
hy =4124, ho=:0°366, hg=0°229, hy=0°161 usw. 
Die h; nehmen nur sehr langsam ab, was auf eine starke Ansteckung hindeutet. 
Man erhiilt 
P —~hn+e 
Po = 108 (1 — q)ve q=(0378, 


und die weiteren in Spalte IV angegebenen Zahlen. 
Das Pearsonsche y*-Kriterium liefert hier wegen x? = 8°9 
P=09, 


* “Sur la loi de Pélya régissant les faits corrélatifs,” Akt. Védy, m1. 18, 49, 1932. 
+ x, und z, sind so gewihlt worden, dass ihre Logarithmen auf volle Zehntel lauten, dass sich also 
ihre Potenzen leicht berechnen lassen. 
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d.h. die Anpassung an die Beobachtung ist als befriedigend anzusehen. Gleichwohl 
fallt auf, dass auch hier die theoretische maximale Haufigkeit hinter der beobach- 
teten zuriickbleibt. 

Bei ansteckenden Krankheiten scheint die Verwendung von drei Parametern 
unter Umgehung der héheren Mittel zu brauchbaren Resultaten zu fiihren. Das gilt 
vor allem fiir Verteilungen mit einem grossen Mittelweit wie in dem vorliegenden 
Fall, wo m= 15°8796 ist. 


TABELLE II. 
Verteilung der weiblichen Diphteriefalle in Bohmen 1921 bis 1929. 








I. Il. Ill. IV. 
Anzahl Beobacht. Verteil. nach | Verteil. nach 
der Fille Verteil. Eggenberger Formel II 
0 1 4°7 0-4 
1 1 5°] 1°6 
2 3 5-2 3-4 
3 4 5°1 5°1 
4 7 5-0 62 
5 6 4°8 6°5 
6 9 4°6 6°4 
7 10 4°4 59 
8 7 4°1 5°5 
9 5 3°9 50 
10 3 3°7 4°5 
11 4 3°5 4-2 
12 1 3°3 38 
13 3 371 3°5 
14 3 3°0 3°2 
15 2 2°8 3°0 
16 39 41°8 39°8 
und mehr 




















§ 12. Verteilung der Blutkérperchen. In einem 1907 erschienenen Artikel der 
Biometrika* wird die Poissonsche Formel benutzt, um die Verteilung der 
Blutkérperchen auf die 400 Quadrate eines Hamacytometers zu untersuchen. 
Falls die Blutkérperchen unabhingig von einander auf die Flache verteilt sind, 
ist auch hier die Poissonsche Formel giiltig. 

Der Verfasser des erwihnten Artikels hat vier verschiedene Konzentrationen 

untersucht. In der ersten Konzentration ist die mittlere Anzahl der Blutkérperchen 
oro Quadrat 
' m = 0°6825. 
Tabelle IIT enthalt in Spalte II die Zahl der Quadrate, in denen die in Spalte I 
angegebene Anzahl von Blutkérperchen beobachtet worden ist. “Student” hat die 
Zahlen der Poissonschen Formel berechnet (Spalte IIT). 

Die héheren Mittel werden angegebeu als 

a= 08117, Ks = 10876. 


* “Student,” “On the Error of Counting with a Haemacytometer,” Biometrika, Vol. v. p. 351, 1907. 
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Da die mittleren Fehler 
Em =0°045, €,,=0°064, ¢,,= 0°20 
sind, kann man die Abweichungen der héheren Mittel von m nicht als zufallig 
ansehen. Ausdiesem Grunde sind die Differenzen zwischen Theorie und Beobachtung 
recht gross. Der Verfasser gibt nimlich an 
x7=9'92 und P=0°04, 
wobei die Hiufigkeiten fiir 4 und mehr Blutkérperchen zusammengefasst sind. 


Zur Erklarung wird angenommen, dass die Blutkérperchen das Bestreben haben, 
zusammenzuhaften (“...there is also probably a tendency to stick together in groups 
which was not altogether abolished even by vigorous shaking”). Wenn diese 
Annahme richtig ist, muss Forme] II mit der Beobachtung besser iibereinstimmen. 
Man erhiilt 

1—q=0°9366; q=00634, p=0113; 
hy=0°562, hg=0°057, hg=0°0024, hy=0:0001 usw. 
Die Wahrscheinlichkeit fiir Gruppen von 4 oder mehr Blutkérperchen ist also 
verschwindend klein. Spalte IV der Tabelle erhilt die nach der Formel II 
berechneten Werte, die hier auf ganze Zahlen abgerundet sind. Was die Giite der 
Anpassung angeht, so ist 
x? =3'33, dh. P=051. 

Diese weit bessere Anpassung an die Erfahrung spricht fiir die erwahnte 


Gruppenbildung. 
Formel Ib liefert die Konstanten 
hy=0°562, he=0°056, hs=0-0029, 
die fast identisch mit denen der Formel IT sind. 


Die hier geschilderten Verhiltnisse treffen nicht fiir alle untersuchten Kon- 
g 


zentrationen zu. In einigen Fallen passt sich namlich die Poissonsche Formel der 


Beobachtung gut an. 


TABELLE III. 
Verteilung der Blutkiérperchen auf 400 Quadrate eines Hdémacytometers. 

















I. I. III. IV. 
Stathteorechen Meu” —— nach ‘oe 

pro Quadrat ertell. oisson orme 

0 213 202 215 

1 128 138 121 

2 37 47 46 

3 18 11 14 

. 3 2 3 

5 1 0 1 

und mehr 











—— 
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§ 13. Zahlen der Blumenkronblitter von Ranunculus bulbosus. Zur Tlustration 
der Poisson-Charlierschen B-Kurve sind von Riebesell* die Zahlen der Blumen- 
kronblatter von Ranunculus bulbosus benutzt worden. Die mittlere Anzahl der 
Blumenkronblatter fiir die 222 untersuchten Exemplare ist 5°631, das Streuungs- 
quadrat 

po = 0-918. 
Spalte II der Tabelle IV zeigt die beobachtete Haufigkeit der in Spalte I ange- 
gebenen Zahlen x der Blumenkronblatter. 


In dem zitierten Artikel wird zunichst der Wert z= 5 als “ Nullpunkt” der 
Verteilung gewihlt, d. h. es ist 
m = 0°631. 


Dann werden aus der Charlierschen Formel 


P,= 222 f(r) +0. A (r)}, 








, om" 
wo ¥(r)= ~ 
Me— ™m 
(iS, 


A®y (r) = (r) — 2 (r —1) + (7 — 2) 
sind, die in Spalte IV angefiihrten theoretischen Werte berechnet, wobei r=a —5 
ist. Spalte III gibt zum Vergleich die Poissonschen Zahlen an. 
Nun ist aber Hg = 1°644. 
Die mittleren Fehler sind 
Em = 0°064, &,=0°080, ¢€,,=0-25. 


Die Abweichungen der Mittel von einander sind also grésser, als die mittleren 
Fehler zulassen. 


Die Formel IT liefert die folgenden Zahlen : 
1—q=0°7906; g=0°2094, p=01794, 
hy =0°4037, ho=0°0897, Ag=0°0125, hy=0°00197 usw. 
Man erhilt die theoretische Hiufigkeit fiir das Vorkommen von 5 Blumenkron- 
blattern : 
S =mee _ 
Po=222(1-—q).e ~ 4=1335, 
und die weiteren in Spalte V angegebenen Zahlen. 
Um die Giite der Anpassung zu priifen, seien die Fille von 9 und 10 Blumen- 
kronblattern zusammengefasst. Fiir die Poissonsche Formel ist 
x? = 185, 
d. h. wegen n’ = 5, P=0°001, 


* Riebesell, ‘‘ Biometrik und Variationsstatistik,”” Handbuch der biologischen Arbeitsmethoden, Abt. v. 
Teil 2, 1. Halfte, Wien, 1928, 8S. 759. 
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die Formel ist unbrauchbar. Fiir die Charliersche Verteilung ist 


x? =0°99, d.h. P=0°91. 
Fiir die Formel II ist sogar 
x?=025 und P=0°99; 


diese Formel passt sich am besten der Erfahrung an, wobei jedoch zu beachten ist, 
dass drei Parameter zur Berechnung gebraucht worden sind. Es scheint, als ob 
sich statt urspriinglich eines Blumenkronblattes zwei oder mehr entwickeln kénnen 
(vielleicht durch Zellteilung) und so das Schema der Formel II giiltig ist. 


TABELLE IV. 


Zahlen der Blumenkronbldtter von Ranunculus bulbosus. 








i, Il. III. IV. vs 
Zahl der Hiiufigkeit Hiufigkeit Hiufigkeit 
Blumenkron- a. nach nach Char- nach 
blatter x “ero Poisson liers B-Kurve | Formel II 








5 133 118°2 134°9 133°5 
6 55 74°5 51°6 53°9 
7 23 23°5 22°5 22°9 
8 7 4°9 9°5 8-0 
9 2 ohts) 2°9 2°6 
10 2 Ol 0-6 Ors 























Vorstehende Arbeit ist durch eine Anregung von Herrn Prof. Dr. Riebesell, 
Hamburg, entstanden. Ich bin Herrn Prof. Riebesell dariiber hinaus fiir wichtige 
Hinweise, insbesondere auf die benutzte Literatur, zu Dank verpflichtet. 


* De Vries, Berichte der deutschen botanischen Geselischaft, Jahrg. 12, pp. 203—204, 1894. 

















ON CERTAIN NON-NORMAL SYMMETRICAL FREQUENCY 
DISTRIBUTIONS. 


By G. H. HANSMANN, MSc. (Stellenbosch), B.A. (Oxon.). 


INTRODUCTION. 


In 1889 Professor Karl Pearson introduced a series of curves which enabled in 
all cases a mathematical curve to be found which had the same first four moments as 
any given frequency distribution, thus providing for the more usual types of frequency 
an adequate description in mathematical terms of their distribution as judged by 
the test of goodness of fit. He reached these curves by integrating 


1 dy _ —(#+a) 
yx Cotcyetcga*+...6,2"’ 





retaining only the first three terms of the denominator on the right-hand side. 


Beside the need to graduate data, there is another aspect of frequency curves. 
Mathematically—generally with certain limiting hypotheses—the distribution of 
certain frequency constants, such as the correlation coefficient, the regression 
coefficient, the variance, and “Student’s” z from normal distributions, has been found. 
In some of these cases the distributions are exact Pearson curves, in other cases 
these curves suffice when the sample reaches 25—30 individuals. In a further 
series of cases it is possible to determine the theoretical moment coefficients, 
although we cannot find the mathematical form of the curve. Examples are the 
distribution of the third moment and that of the ordinates of the regression line at 
a given abscissa, both from a normal population. Both curves are symmetrical and 
neither curve is exactly described by a Pearson curve with the same first four 
moments. Furthermore, in the case of the distribution of ys, a Bg= 12°5 has been 
found. Here the corresponding Pearson curve may give us a very bad fit, because 
the values* of 84 and of 8g, as determined from the difference equation, are negative 
when 82> 6 and 4°5 < 82 < 6 respectively. 

We now ask: To what extent shall we better matters by taking into account 
higher moments? What are the types of curves to which higher moments lead us 
in such cases? Shall we be able to find a higher order symmetrical Pearson curve 
when f; > 6 and 4°5 < fy < 6, for which 84 and zg are respectively positive ? 

Dr David Heron has already discussed the third order Pearson curves which 
correspond to the differential equation 


1 dy _ = te) 

ydx co + Cr@ + Cgu* + cga*” 
5B 4 358, 

a 6-B, Es (6 “ 8) (9- 282) 


Biometrika xxv1 9 
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but his manuscript* has remained unpublished because the addition of a single cs 
applied to actually occurring frequencies did not as a rule materially improve the 
goodness of fit. 


The aim of the present paper is to include the following term in ¢, but to 
consider only the symmetrical curves, for which a, ¢, cs may be taken to be zero, 


This leads us to the differential equation 


ldy _ dE TR Se Se RR (1). 


ydx Cy + Coa + cya 





It is to the discussion of the integrals of this equation and their applications 
that this paper is devoted. 


I. THE DIFFERENT FouRTH ORDER PEARSON CURVES. 
(a) Discussion of Types. 


Without the loss of generality we can write the differential equation (1) as 








ldy _ alo 
da a af’ 
I= bet ba—a+ha 
o o o 
where the constants by, b. and by are now mere numerics. 
1 dy x 
Therefore ee TTT 2). | 
y da / ( ) \ 


bo o* + bya? + ba oA 
co 


Multiply both sides by ya**+4 and integrate with respect to # between the limits 
a, and ag. Divide by N. 


1 ay a’ a, 
Memes - eh o8 line? + ben? tha nlden | ahi yds 
VJ, ot) “I =H J 


2 
Ja 


Integrate the left-hand side by parts, We find 


1 2 ba ) o 
WV gut |Pao* + box? + al y |" 


1 Ay \ 
= x] \(2s + 1) bo o2.n"8 + (2s + 3) bg a®+? + (2s + 5) =" a y da 


2 
a, 


I I 2548 
S a+ yda, 
N Ja, 

Take the origin of # at the mean of the distribution. Now since the distribution 
is symmetrical the mean and the mode correspond. Hence the limits of integration 


1 pte 
V | a ydax = ws = sth moment about 





are from a, = — a to dg= +a and the integral 


the mean of the distribution. 


* Tam very gratefal to Prof. K. Pearson for drawing my attention to this manuscript. 
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Therefore 
(25 +1) pa Do? + (26+ 3) ba parse + (28 + 5)“ worse Ca — Mans 


Qqrs+1 f 


= WV jos + bea? + = at} Ya cnccetvesccecensenees (3). 


where Gs 





Now c, vanishes if 


(i) @ is zero; but a cannot vanish or there would be no range; 
(ii) a* is a root of bya? + baa? + = =0 and if y, is finite; 
(iii) yq is zero and if (boo? + bea? + = a‘) is zero or remains finite. 


By means of (ii) and (iii) we are able to fix the limits of integration. Put *=uin 
Cc 


the equation (2). We obtain 


1 dy u ! 
a nnn ec cccccecccsccccccscccccces 4). 
ydu by + beu® + baut (4) 
™ ‘ 1 PS an , 
Now write m ==, z=u?=—,. We obtain 
Qh, o 
1 dy my ? 
y dz bo ms be eect oo (5) 
by by s 
r . bo be 2 - 
The expression ( +5 z+ 2") can assume any of the forms 
a Of 


(1) (~it+2)(q +2), real and unequal factors; 
(2) (+2)? + 4, imaginary factors; 
(3) (pi+2)’, equal factors. 


Let us integrate the differential equation (5) in these three cases and consider 
what happens to the expression for ¢q. 


1 
(1) Ldu_ my 


ydz (p~rt+z)(q+2) 
_ ™% ( 1 ie. = ) 
Qa—fa\Pit2 Gat2/ 


m +z 
log y = const. + (~ ~) log e + ‘), 


Integrating, we find 





therefore Y= Y% 


> 


We. 
Pr + 4° ) 





where Y is the constant of integration. 








132 On certain Non-Normal Symmetrical Frequency Distributions 


Notation. I shall use letters with suffix, to represent the quantities m,q, p,kand k’, 
when they can be positive ‘or negative and unsuffixed letters to represent their 
absolute values. Write 


q=lal, P=|pil, m= |m|, 


I shall always take g > p. 
(i) pi and q both positive. Hence writing p =p, and q = q, we find 
Ca = const. x a*+1 (a? + p)ir+t (a? + gi, 


Hence c, = 0 if a? =— p, when we have an imaginary range, or if a?= 0 , provided 
in addition (2s + 1)+ 2 (ky +1) <2 (hk, —1), ie. 2s<—5. Hence c, can only be zero 
for negative moments. 


(ii) p,<Oand q>0. Write p=—p, and g=q. Clearly the equation for y is now 


Rare (2—)" 
¥=Yo qt+u . 





Accordingly, cg= const. x a+! (p — a?)A’+4 (q + a?)*r and becomes zero at a = + vp, 
provided k,’2—1. If ky’ be < —1, cq can never be zero. 


We get a similar discussion for p; >0 and qu < 9. 


(ili) pp <0 and q@<0. Write pp=—p and qa=—g. The equation for y 
reduces to 





2\ k 
g-u 1 m 
d ) , where ky = —?-. 

q-Pp 


We now find ¢q = const. x [p — a®}/-4[q — a®}*+4 a?" and accordingly c, = 0 at 
a= + \/p, provided k,< 1, but can never be = 0 when &, 2 1. 


Y= Yo (= u* 


1 dy Qmyu 


(2) y du (4 by +o" 


Integrating with respect to u® we obtain 


m b, + u? 
log y = const. + —*tan-! 7 : 
2 Cy Cy 
my enn b, . u2 
therefore Y = ye be 
b 2 
= tan! 3 - 
Hence Ca = const. x a%+1 e% “t [(a? + by)? + ¢,?]. 


Clearly ¢, can only be zero if a=+ 0 and if in addition we have the condition 
(2s +5) <0 or s< —§. In other words, for positive moments ¢c, cannot be zero. 
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9 
(3) Ldy_ — 2mu_ ; 
ydu (m+u?P 
my 
therefore log y = const. — rae - 
BE 
therefore y=yoe Bt”, 
_ ™% 
Accordingly Ca = const. x a*+e + (p, + a2), 
Hence c,=0 at a? =— p,; provided m, > 0, but can never tend to zero as a?@>+ » 


m, 


when s >0. In other words, of all the possible forms of ye ” 1+ ‘the ouly equation 
which can represent a real frequency distribution is 


4 m : 
y=yoe ?™, 
where we must take the range for u from — /p to + /p. This solution correspends 


to m, < 0, > > 0 and “a < 0 in the differential equation (5). 
4 4 


In the present paper I shall confine myself only to those equations which 
satisfy the conditon cy = 0. 


Consider the equation 


y= Yo fas =) 
where Pp =|pr, 

q=|al, 

m=|ml, 

B= |b'|=2 a 


I shall take the limits of the range, viz. a, to be defined by the relation c, =0, 
and hence in the same way as above we find a=+,/q. The corresponding 
differential equation is 

ldy 2mu 


Compare this equation with the Equation (5). Hence 


my =m, and therefore m, > 0, 
bo bo 
b, = pq, "i ra a > 0, 
ba be 
by = (q _ P ), » » by < 0, 











TABLE IL. 
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which gives us the criterion for this type of curve, which I shall call the Type (i) 
curve. In this way we find the following types of curves and their criterion : 








(ii) 


*(iii) 


*(iv) 


(vii) 








Form of Curve 


Range for u 


Criterion 





b-ratios 


m, 


k or k’ 








_ /9 to 


—,/p to 


—./p to 


—,/q to 


—/p to 


—./p to 


- Jp to 


+./9 


+ Jp 


+./p 


= 5 J 


+./p 


+,/p 


+./p 








| m,<O 


m,>0O 


m,>0 


m>O0 


m<O | 


m, <O 





| 
| 
m <0 | 


k<l 


eat 





K<l 








* The Types (iii), (iv) and (v) are u-shaped curves. 


, ‘ om be\? 
Note that for the Types (i)—(vi) we must also have the condition (3) 
4 


so that p and q may be real. 


+ m=|m,|. 


The general differential equation for the Types (i)—(vi) is 


o-. -° tae 
y du ~~ + (at u)u*+ ut 
Compare this with the Equation (5). 


2myzu 
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1 
Therefore m= ab, 
<= a 
Aan = bs » wees esesescesescscece 
be 
ts al 





Let us express the quantities bo, be and by, aa therefore 7m, pig: and p;+ qi, in 


terms of Bz, 84 and fg. 


Put ca= 0 in (3) and the difference equation reduces to 





senate (6). 


‘ 2 al 
(2s + 1) pesbope + (28 + 3) be peosi2 + (28 + 5) = Mesi4a = — asiz- 


b 
Put s=2: Dope + bapa + 5S ma = — pe, 
by 
s=l1: Sbo 2" a Sbe pte + 7 — Me = — Ma, 
Ke 
2 5 ” ¢ be 
gaz: Sbopapa t+ Tape +9 2 a= — Mee 


Divide the equations above by pe, 2”, ve® respectively. Write | 


Ma Le Bs 
=—3, = and =... 
Be pe Ba 2 Bs 


We obtain bo +3b, +5b482=-1 | 
3bo + 5b282+ Tby8s = — Bo >. 
5bp Bz + The84 + Iba 8, = — al 

Solving, for bo, be and by, we get 


2 {148.° — 9828. — 582" Bs} 


bo = 98. (582 — —9)-—498,2 + 210822; — 12588 
by = _ {9B (3 — Ba) + TBs* — 508284 + 2583") 

{9Be (582 — 9) — 498.2 + 210828, — 1258.3} 
bya 2 {5s — 84 (6 — Be)} 


986 (582 — 9) — 498.? + 210828, — 1258.8 


The solutions are valid, provided that 


A = 98, (582 — 9) — 498," + 2108284 — 125828 


is not zero. 


If A=0, we solve for 


b,’ 6,’ 

bo a 148,° — 9Bs Be Ne 5Bs "Ba 

ba 5B — (6 — Ba) By 

ba _ 986 (3 — Be) + TBP — 5082 84+ 258° 

by 2 {582" — (6 — Ba) Ba} 

1 986(582— 9) — ~—-* + 210828, — 12583% 
2 {5 a —(6—- Ba) Ba} 


and find 
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Since A=0, i.e. 
98, (582 — 9) — 498? + 2108284 — 1258, = 0, 


om bo ae : 
_* 0, but bs and p, remain finite. 
Hence our differential equation reduces to 
+ 
yda 


The integral is now y= constant, or the frequency curve is a rectangle, which 
I shall call the Type (viii) curve. 


The solutions (8) above are valid only if 582?— (6 — B2) 8s #0. If 
5B —(6—B2)84=0 and 98,(582—9) — 4984? + 2108284 — 1258.3 = 0 
5B? _ 25828 (82+ 8) 
6B, and B.=- 9(6—Ba®’ 

5B? — (6 — Be) Ba, 
986 (582 — 9) —49B2 + 210B2 As — 125833, 
1487 — 9828. — 5B? Ba, 
9B. (3 — B2) + 7By? — 5082 8s + 25838 

becomes zero and we cannot find the values of b;, 5, and 4, or the corresponding 


type of frequency distribution from the difference equation. In Section h (8) I shall 
show that the frequency type corresponding to the values of 


are simultaneously true, ie. if By= each of the 


expressions 





5 Be" } 
m6 Ba | 9) 
a Se Sabana aaetes iaeaacaeeS (! 
e"  9(6—Ray 
-~M 
p-x*/o* 


is a special form of the Type (vii) curve, viz. y = yoe 


(b) General Discussion of the Constants of the Curve. 
The following values were found : 


oo bo = _ 2 (98286 — fo) 
1) Gt “ies 2K , 


where fo= 148,? — 58278, and K = 58? — (6 — Bs) Ba, 


; bs = 986 (3 — Be) — fr 
Seg See ; 
where f2 = — (784? — 508284 + 258,°), 


_ 1 _ 986(5Bs—9) -fa 
~ Dd, 4K , 


where f,= 498.7 — 210828, + 12528. 


m 
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I have divided the curves into the different types according to 


(i) the sign of + ’ “2 and m,, hence the sign of piqi, ~i+ 1, and m,; in other 
a O% 


words we want the signs of the expressions 
K = 58? —(6 — B2) Bs 
Fy = 9828 — fo | 
F, = 986 (3 — 2) —Se 
Fy= 986 (582 —9) — fa 
(ii) whether p; and q are equal, real or imaginary; hence we require the 
a 818.2 (3 — Bs)? — 18B4b +c 
(u— pp = eS ee 


where b = (3 — Be) fo— 882K, c=f2? — 16/0. K, and L =784(82—3) + Bo? (19 — 58s). 


On simplification 


bnew sanabacasouewaeee (10); 





(mn -j ye — 81 (B2— 3)? iq. o+Sik vl) _b-—4\K\ VL) 
a- Psst 9G@-7 \(" 96-7 5 


I shall write this equation as 


1) 1) 81 (8 — 3) 

Fea (Fi) (re) 22 
ran nrg OF 4K VE png _ b- 4 Kl VL. 
where F,' = Bs 93 —Ba’ F;''= Be 9G —As)? ° 


(iii) whether k& or k’2 1, hence we require the sign of [m,:—(qi—x)]. The 
zone where ¢a+ 0, where piq:, pit+q, and m, have other values than those we 
found, is bounded by Fs=0 and m,=(qi1—j1). We therefore want to write the 
condition [m,—(q1— p1)] 2 0 in a workable form. To avoid the difficulty of the 
square root and also of sign, I shall consider Fy = m,*— (q1 — p)*. 

On simplification we find : 

ae 2438,7(82—1) (782-15) —188,B+C 
FO di a! cin. soni 











16K? 
_ 243 (Be meh — 15) (Fe'} {Fy}, 
where F;' =Be— 27 5S, cee 15)’ 
Py’ = e- B-—4|K\|VQ 


27 (B2— 1) (782— 15)’ 

B =(582—9) fa— 40, 

C =fP—4e, 

Q =498,? — 1484(3782—30) + 78? (6082 — 53). 
Hence F, = 0 (a = 0, 2, 4) and F,= 0 (y = 6, 8) represent surfaces of revolution in 
the B2, 8s, Bg space such that sections by the plane 82:= constant of F,=0 
represent parabolas and of F,, =0 two intersecting parabolas. 
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The position of any point (B2, 84, 8.) with respect to these surfaces of 
revolution determines the signs of the expressions F,, and F,, and the signs of 
rN, Pit Gu. Mm, (G1—pr)*, and [m?—(q — p.)*]. Hence, theoretically, a model 
of these surfaces of revolution will enable us to determine what type of curve 
corresponds to a given set of values of Bz, Ba and Bg. 


(c) The Transition Types. 


From the previous Section it is clear that when 2, 84 and 8, satisfy F, =0 
(a =0, 2, 4, 6,8) either bg, be or bg vanishes or there is a relation between these b’s, 
and the general differential equation is simplified. To each variation in the form 
of the differential equation there corresponds an integral. Here again some of the 
types so obtained are not admissible because we are confining ourselves to those 
types of frequency curves which satisfy cg=0. The discussion is similar to that 
given in Section (a) and is omitted here for the sake of brevity. The results are 
summarised below in Table IT. 


2 
Notes. (1) When pam Gee bs=0 and the general differential equation 
— Pe 


ldy_ _u 
ydu by + bau? 


reduces to 


This is the equation Professor Pearson integrates to obtain his symmetrical 


2 
types. Let us see if by proceeding to the limit as Ba om we get the same 


Be 
values as he deduces for his curve. 
“he 9Be(8—B2)—fe _ 986 (3 — B2) + TBs — 508 284+ 258." 
*” 9B6(5B2—9)—fa 986(582—9) — 4984? + 21082 B4— 125822’ 








3— 
(6 — =i ve 9Be (6— B2)? - 2583 (Be = 8)} 
therefore lim : by = oh 
Pome, (6— 7 (6— Bo? 2 (98s (6 — 82)? — 2583° (82 + 8)} 


_ 3-8 (644 BH Gx+8)) 


5B2—9’ 9(6 — Bs)? 
Similarl aa 28s 
. 6—B, 


It would seem as if we should have Professor Pearson’s types at every point along 


5 Be” : ok 
the curve 84= ,o no matter what the value of 8. is. But this is not so. When 


B2’ 
Ba= rae b,a=0, be and by can be expressed in terms of fg: and are fixed for 
constant Ba. We get a relation between Bg, 84 and Se, and find 


358% 


Be= (= Bs)(9 283)" 














_ 7 ae ee 
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Main Types 





Transition Types 
















































































TABLE II. Table showing the various T’ of Frequency Curves. 
g Ypes equency 
Limits Criterion 
Type Equation of Curve | of the Differential Equation . ; Remarks 
Range m | p91) %1+Pi | (G1 ~Ps)* | mP- (G1 —Pi?? 
: __fe-w 1, | 1 dy Qmu < J 
(i) Y=Yo 4 +./9 y du =@-p) wae +i = + + or 
= _. [p-w} F ldy_ Qmu 
(ii) I=} G+u2 ENP | y du —pq+q—p) @+u! pweut| +] -| + - +or- 
ee __ fe-wf 2 xa sae : = 
(iit) | y=n| S| =p ydu pa—(qt+p)v+ut © ita es 
, } | 
; Q ptut) F 1 dy _ —2Qmu Pe = a 
(iv) y=Jo a4 =Nq ydu —pq—-(q-p)w+ut " 
W 1 dy — 2mu 
Fae = / a a a ne =: 
(v) y= 4 | vP ydu —pqt+(q-p) @+u8 | + + 
: _. [p-# | 1 2y _ — 2mu ~~ i ts 
(vi) Y= Gg Al +,/p y du pa—-(qtp)e@+u' 7 + or 
ldy  —2mu : Bs, By and B, 
(vii) | y=ye ? * EVP | y du p?—2p+u! Bal ot By oo eee satisfy F,=0 
i 1 dy For all valves of p, and g, provided p; 9, p1 +g: and m, are 
(vill) ¥=Yy.=const y du sale not simultaneously zero. 8, 8, and Bs satisfy F,=0 or 
simultaneously F',=0 and F,=0, F,=0 or F,=0, F,=0 
ix) y = | 2 1 + ke oo Oo | i = , 
(ix) Y= pre ; VPlydu —p+u +i = .-* | bth as Bs, By and B, 
a 7 ~ < 
7 pt+u* | rn, | Lay _ —2mn PM ss | ¥ - satisfy 
(x) y-9| Ore id =P ydu  —p*+ut 0 Lite | F.= 
9 m ‘ 
ae Me g-u |— a 1 dy _ +2mu * 
(xi) y=Yo [ u2 ] ; vg ydu —qut?+u! dl + oy | 82, B, and By 
sn) u2 3 + Jq | 1 _—2mu = f° x= + ny f pe | 
(xii) ¥ % F ee ue Vqd y du —qut ws | 0 
—u? 2 
sii) | y=y |S +9 + (p+9) - a; +i - - | + | 0 
ptw ydu —pq—w(q—p)t+u 
; | pw me i dys 2(ptaq)u | Bs, — Be 
xiv) I=Y%o qu tp ydu = —qp+(q—-p)v+u pie Gas | x vie 0 "0 
t ldy  -2 (q—-p)u { 
"hae =Yo ! 
aide Gite: [ie] VP | ydu~pg—(p+gwew |-]}+| - | + | 0 
f a Gee 1 dy _ —2Qmu | ‘ 
(xvi) | Y=Y% (p?- wu)” tp ydu pw mr 58° Type‘II-curve 
ae rae ee ldy _ —2mu * 6-2, r 
(xvii) Y=Y (p?+u?) +a 7 du" paw ‘ | 358.8 Type VIl-curve 
= be ae ldy 2mu | Bo= (9— 28.) (6—B.) Type II 
cv = 2 y2\—m = == “2 : 
(xvii) | Y=% (p?—w*) tp yau pw (u-shaped) 
(xix) y= ei ao ; dy _ =i B.=3, By=15, Bg=105 Normal curve 
2 Be, 8, and Bs 
(xx) y¥=Yo z | +,/p 4 ll <b a 0 simultaneously 
+u* ydu pu satisfy F,=0 
== and F,=0 
m m 
m=|m|, p= =l|¢,j, 4=—— = —— 
il, P=lpl g=lal q-P’ q+p 
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see - 5B? ! 
Hence there is only one point on the curve B= 2 for each Bz which corre- 
—Be 
sponds to Professor Pearson’s symmetrical curves. At other points we can have 


other types only. 


The normal curve is a special case of the 2nd order symmetrical curves and 
corresponds to the point Bz=3, Bg=15 and Bg= 105. 
m 
(2) Consider the equation y= ye". 
This equation is obtained by integrating the differential equation 
1 dy _ 2myu_ 2m, 
yu il tl 





Hence it is the type we find along the curves of section of the surfaces 
(i) Fyo=0 and F,=0, (ii) Fo=0 and F,=0, (iii) F,=0 and Fe= 


Note that since Po=pigm, Fo=qit+pi, Fe=(qi—p1)*, the surfaces Fy=0, 
F,=0 and F,=0 in general cut each other where p; = 0 =q, znd hence we have 
a common curve of intersection. 

For the equation above 

mo? 
Ca = const. x a*+4e a 
which is never zero and accordingly the equation 
“is mo* 


y=yoe * 
does not represent a complete frequency distribution. 


(3) We can write the Type (xiii) curve as 


(g+p)o* _ 1], ie 


Y=Yo| not + a2 &. yt Yo=yo(q+p)o” 


po® +a?" 


Hence the Type (xiii) curve is the part above the line y= yg of Pearson’s Type VII 
aver 
ape. Similarly the Types (xiv) and (xv) represent the parts 


above the line y= ye of the curve y= yee os of 


curve y= Yo 





- and the inverted u-shaped curve 


ye f= )o* 
die ass 2° 
° qo? xz 
(d) and (e). The Inequalities satisfied by Bz, Ba and Bg; Note on the Surfaces 
F,=0 and F,=0. 
Before we can discuss the intersections of the transiticn surfaces with each 


other we must know something about the relations between f,, 84 and Be, and 
also when the branches of Fg =0 and F,=0 become imaginary. 
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In this way we introduce the further conditions, viz. 
(i) Be >B28a> 8% and By>A,?*, 
and (ii) L>0, Q>0. 
It is easy to show that 
I. (i) for By >3, Fe’ =0 and Fy” =0 cut at Ay= 





2 
-— 2, and are always real ; 
— 2 


2 
(ii) for 8g< 3, Fe’ =0 and F,”’=0 cut at Ag= — and again at 


Ba 
Bz = Ba (19 — 58a) 
* 7(3-B:) * 


the branches F,’= 0 and F,” = 0 become imaginary. 


2(19—5 
For values of B4 > Be? (19 — 58s) 








7 (3 — Be) 
2 
II. (i) for Bo >44, Fy’ =0 and F,” =0 cut at By= man and are always real ; 
— Pe 
os , ” 5B" ’ ” 
(i) for Bp< 48, Fy’ =0 and Fy” =0 cut at Ba=5 B , Ba=a’, Ba =a”, the 
— Be 


curves becoming imaginary for values of 84 between d’ and d”, where 
d’ =1[(37B2 —30)+2(f,—-1)V15(15—7PBe)]), 
d” =1[(3782— 30) — 2(8,—1)V15 (15 —78s)]. 


(f) The Intersections of the Transition Surfaces with each other. 
Let ¢, denote the value of Bg on the surface F,=0. Then by finding the 
values of Aq in terms of 82, which make (¢, — ¢,) = 0, ( = 0, 2, 4, 6, 8), and by 
s 


substitution in F,=0 or F,=0 the value of 8g, also expressed in terms of B,, we 
can determine the curve of section of the surfaces F,=0 and F,=0. 


We have found 


do = i = (14847 — 583784)/9Be, 


f =9 Ps Ba) =~ \7Ra? — 5082 Ba + 2582"}/9 (3 — a), 
nm “5 (Ge, — 9) = 498s? — 2108284 + 12582*}/9 (582— 9), 
» b6-4|K| VL 
$= 98-B ” 
B~-4|K|VQ_ 


Pe = 97 (Ba— 1) (TB — 15)" 


* J. Shohat, Biometrika, Vol. xx1. For a discontinuous distribution the proof is straightforward 
and similar to that given by the Editorial Note on Shohat’s memoir. 
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where K = 5B." —(6 — Bs) Ba, 
b=(3— Bs) fe— 882K, 
L=B,? (19 — 582) —7 (3 — Bz) Ba, 
B=(582—9) fa — 40, 
Q = 498,? — 1484(3782 — 30) + 782? (6082 — 53). 


(a) The following table gives the values of 84 and fg, in terms of 8s, at the 
intersection of Fo, Fz: and F4=0 with each other: 


TABLE III. Values of 84 and Bg. 





Surfaces or — Ps Values of 8, Values of 8, 








(218, — 258,*) (58:°-6—B28,) | 58,?/6—B, | 258.5 (82+8)/9 (6- B,)* 


‘= F. =0 as 
rere fs 9B» (5B, — 9) 2528,2/21 1258,8/81 


| 
| 











F,=0 and Fy<0 | 2(781-158:) (5826-281) | 58/6-B, | 258,8 (8,+8)/9 (6 - B® 




















~ -9(3—Be) (58-9) 1582/7 2582/9 
| 
| 5 es 
F,=0 and F,=0| (78s— 58s!) (58:'-6—BeBs) | 5B? * | 2582" (82 +8) 
| 92 (3 — Be) | 6-B | 9 (6 —B2)* 
| | 


| 





(8) The discussion of the values of 84 and fg, in terms of 2, at the intersections 


» ,(t=9, 2,4 
of F,=0 and F,, =0 te 8 


have a common curve of section, viz. 82= Be, Ba= 


) is more complicated. We know that all these surfaces 


52" _ 258° (B2+8) 
é——, = o6—gyr ond 
this enables us to simplify the lengthy algebra since (¢, — ¢,) (" =0, 2,4, 6,8) must 

5A.2 
ne ie a : : , 
a common curve of section, which by considering the expression for (dg — 4) is found 
to be 
B2=PBs, Ba=}{9(582—6) + 2(9—5As)!}, Ao=4 {+ 8(9 — 5As)t 
+ (18082 — 258." — 216)}. 
Similarly the curve of section of Fg=0 and Fy=0 is 


2 3 
B2= Pz, a= 19Ps ’ a = ASPs > 


and finally that of Fg=0 and F,=0 
B2=Be, Ba=4{(7B2 +24) +4|B2—3| V5 Be + 4}, 
Bo = vi {(85P2" + 1682 — 128) + 16 (382 — 4) (52 + 44}. 


* Note 6,> 8," and therefore 78,>58,° for all values of 8, and f,. 


as a factor. Furthermore, the surfaces F',= 0, Fg = 0 and F, = 0 have 
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The numerators of the corresponding expressions (¢,—,) each contain a square 
root and in obtaining the results above we may have introduced inadmissible values 
by squaring; furthermore the values of 8, and 8, must satisfy the inequalities 


(i) Be>B2Ba>Ps, Ba>Br®, 
(ii) L>0 and Q>0. 
After some very lengthy algebra, I found for which values of 82 the above solutions 


are admissible and interpreted the + and — signs of the square roots above. This 
discussion is given in the original manuscript *. 


(g) On the Division of the Bz, Ba, Bs Space into Zones corresponding to the 
different Types. 

In the previous Section the value of the expressions (¢,— ¢,) was found and by 
discussing their sign we are able to find for which values of 8: and 84, ¢, > or < ¢s.- 
This enables us to determine the relative positions of F,=0 and F,= 0 for any value 
of Bz, 84 and §¢ and also to discuss theoretically the curves of section of the transition 
surfaces corresponding to any given value of 82. The introduction of the surfaces 
F,=0 and F;=0 makes the classification of the different systems of curves of section 
very complicated and I kave confined myself to the surfaces Fo, fF; and F,=0. Their 
curves of section can be divided into three main classes, according as 82<1°8, 
18 < 82 <3, 82 >3, as shown in Diagram I, Figs. I, II, and III. The types of curve 
in the various zones are found by discussing the signs of Fo, F; and F, for the 
corresponding values of 82 and 84. These figures were of great use in the construction 
of Diagrams XII to XX, serving not only asa check, but also enabling us to determine 
very rapidly the types of curve corresponding to the different zones. 


Note. The curve F,=0 in Diagram I, Fig. III, has now no meaning because 
points on it do not satisfy the condition cg=0. I have reproduced it for the sake 
of uniformity. 

(h) The Constants for the main Types. 

The constants for the main types of frequency curves are expressed in terms of 
Bs, 84 and Be, viz. 

9 
m,= —;(5B82—9 — $4), 
1= GR B2—9)(Be— $s 


9B>, 
up=— 7 (Bs— $0), 


9(3— 
atpa= ‘ 5 a (Bs— $2), 


my eds J 4 my — (91 — pi)” 
a-~A (qi — pi)” 


- 3 (Bz — 1) (782 — 15) (8s — $s’) (Be — $s"") 
=t 3 . 4(8,—3)* (Se— 8 )(8s— $06") 


* Copy in the Library, University of London, 
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FIGURE I. 


B,<1-8 











FIGURE I. 
ing ® lo/ 



































‘ 
4 
xii F,-O 
4 w) F, =O} SAME AS ABOVE. a 
F,-0 
= 3 or 
B 158, 258; SCALE OF B, 5B: 
7 21 6-B, 





FIGURE II. 


BS 


xi 


(iv) viii . @ e / 
Ss 


By 258; SCALE OF B, 
Diagram I. Figs. I, II and III. 


























i : . 1 we: 4 . 

Ike last formula which gives k= “and also k’ = ——— is too complicated to be 
ire qt+p 

of any use in practice. It is far simpler to find m, and (q;— 1) from the formulae 


(6) and (7) and then = by division. 


ji— pr 
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Let us now find the constants of integration of each curve. Let the total 
frequency be NV, f(x) the frequency curve which is symmetrical, — 6 and + b the limits 
of frequency, then 


+b +d 
ve| f(a)de=2| ST (x) da. 

= 0 
By expanding f(z), we easily find the following expressions for yo: 
TABLE IV. Values of yo. 








Type Integral for N Expression for y) 








tn 
NG.o go : as © Hf (By i! | ng 
r . any | Bae =r)tr! i, 


aw, 
> 
Il 
abo 
o 
aan, 
aR 








mi fi 4 —y a /3 , 
(ii) N= 2yo | y —- da Yo= N = _TG +k ee 
i bee OLD J\ r(1+k) F (Kk, 4, & +3, -—p/9) 

















| | 
sos, | oNnp | N r (3- k) 
111) N=2y _—_—_— lx Y¥,=— ~——+- -—-* - 
; | vel x ) a [°° op ve r(l—-k) F(-&, 4,3-4, p/q) 
| 
| 





¥ t 
Tq (2 
wa (2) 1 


r k’ 
+5 
ae 2 dx Y= 7 
es a(p+q) 1 
qo 


(iv) 


s ong 
N= 2yo[ 
0 


r(3 —i’) 


le | y=: = J 
¢ : o/p PY ps ra. I’) F( i. Kk, i, 3-k,—p q) 























ery hk 
rol 1- y ./3 
: f oNp 2 N r(3+hk) 
(V1) iy = 240 | anediatge dx fo". ss 
Jo ; x ONPN a (144) F (4, L,3t+h,p q) 
qe | 
oo 
* Reduction formula is 
r-ta-ye-r-t rey =B,(h. k’-3 ai 
L.=- 5-1 4 esc y te T,=8,(4, & “2 ee 
: te tte 
4 I =f, 3a I, ‘O22 ge I, T onsy 


where : ? 
v-Fa-ykti-r ore 


Ty = Byasp (L- k’, k’ +4), an incomplete B-function, J,’= ee Te ee 


Biometrika xxv1 10 
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(For continuation, see Table VI, p. 150.) 


The series and the hypergeometrical functions converge rapidly when p/q is 
small. In other cases it is quicker to use quadrature formula. 


The Transition Curves. 


There now exists a relation between (2, 84 and 8.4; by substituting for Mg in 
terms of 8, and £4 we are able to express the constants of the curve in terms of 
Bz and A, and thus explain the ambiguity of sign which we find when working from 
the difference equations. This was necessary 


(1) to discuss whether a given point on any of the transition surfaces corresponds 
to a fourth order Pearson curve consistent with our hypothesis ; 


(2) to determine the type of curve at all points on the different branches of 
Fs = 0. 

The discussion is long and summarised to some extent in Table V on p. 147. 
I have excluded from the table the Type (viii) or rectangular distribution, the second 
order Pearson Curves and the Normal Curve which for convenience I denoted by 
Types (xvi), (xvii), (xviii) and (xix). 

It is interesting to find the values of the constants, expressed in terms of 8: and 
8;, directly from the corresponding difference equation which can be found from first 
principles or more simply from the general equation (1) on p. 130 by making the 
appropriate change. 

Let us now find the type of curve corresponding to the point J whose ordinates 

5B? 2582*(82+8 : ‘ 
are Ba= ae Be=- ie (6 Ba) ). Let us approach the point J along either 
«=0 or Fe” =0, 


_ -86..K+4|K|.VL 























Le. i ae 
_— 882-K+4VK*.L 
4K (82 —3) 
_ ~2B2+VL (Bs) 
4(B2—3) ° 
co aN ‘ _ 5B: ee 
and hence putting &4= 6— p? and simplifying, 
—28. + Bs een” 
P= : 
B2—-3 
‘mi _ 5B* . : es (11) 
Similarly the value of m when B4= é—s," 
™~K2 
_ (582-9) B2VB2—1,, 7 
M= (6— Bs) (a —3)* [3 VB,—1—*(5B2—9)(6 — Ba)], 





on simplifying. / 
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TABLE V. Values of Constants in terms of Bz and fy. 
Type of Values of the Constants in terms of 
Curve B, and 8, —— | 
m 
: The form of the equation is Y=Yoe pe 
peer (FV E+ 28.) 2, 8, and B, satisfy Fy=0, Le. | 
3— Be a= _b+4|K\IVZE | 
(vii) eee 1 {EVE (58-9) ° 9(3—B,)? * 
2 (3 — Be)? We must take the + sign if (i) F,’=0 and | 
— [7B (3 — Be) — Be (5B2+9)]} K<0 or (ii) F,’=0 and X>0, the —sign | 
U7Bs if (i) Fy=0 and K>0 or (ii) Fy’= 
and K<0. 
The form of the equation is 
m 
Ps (e =)" 
1 Y=% pt+ uz * 
P’=3—p, (784 — 58s) Bo, By and f, satisfy F,=0, ice. 
| (ix) 4 1 
] = (7 2 1 25 3° . 
| m= 93 —B,) (7B, — 1582) | 8 9 (Bp—3) (18 5082 83+258,°) 
| Fi i | Since p* and m must be positive we must 
also introduce the conditions 8. <3 and 
ks 
oni — oe 
| The form of the equation is 
Ld 
| : 1 te j 4-\2p 
| eS k (784 — 58s) y= (22 ;) 
|) i | hs 
| 1 | 8, By and 8, satisfy F,=0. 
= (3— Be) *" (784 — 1582) We also have the condition 8,;<3 and 
, a 158, 
| By 4 es sg 
m 
The form of the curve isy=mo( =~)" 
we I= 9g, (1B — 582") By, By and 8, satisfy Fy=0, ice. 
(xi 
1 1 ¢ 
m= 43. 218, — 258.7) 30= 55, (148,? — 5857 ,). 
| We must also have a> but <i *. 
be igi | oe em § cacti a ee 
78, — 58; : w ; 
. aa The form of the curve is y=% (. =a) . 
(xi!) Sek ace ) B., 8, and B, satisfy /)=0. 
pe 48. (2582°— 2184 | The Type (xii) satisfies ¢,=0 provided 
i Br<tth t Bo”. 
The form of the curve is y=y (2-5). 
i WS. ptu- 
1 By, 8; and 8B, satisfy Fs=0, ice. 
p= -_ — 938.) S 
can | PCG 1B) ENO (iBr— 2880) a,—- BtslKIVe | 
(xiii) 1 + 7 (Be— 1) (78 — 15)” 
1= 6B, pit* Q+7 (81 —B2)} We must take the + sign if (i) Fy’ =0 and 
\Pe~ 4 K>0 or (ii) F,’=0 and A<0, and the 
—sign if (i) #’=0 and A<0 or (ii) 
F.’ =0 and K>09. 
) 
. : <} for finite frequency, which easily reduces to p< er 
¢ 


10—2 
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TABLE V (continued). 





Type of Values of the Constants in terms of 
Curve B, and By Remarks 





The form of the curve is y=y,(2—", 
e form of the curve is y=» (Cre z 
Bo, By and Bg satisfy Fy=0. 


1 =~ : We must take the + sign if (i) /y’=0 and 
q= 2 (7B, —15) {+N Q-(7By — 2382)} K>0or (ii) Fy” =Oand K<0; othersiae 
the — sign. 


P= gaz (tYO+ (78-78) 


(xiv) 








-—u 


The form of the curve is y= = 3) ; 
Be, By, Bg satisfy F,=0. 


P=575— = V@Q- (781 — 238:)} 





(xv) 


1 Be se We must take the + sign if (i) F,’=0 and 
1~ 6 (B,—1) {+VQ+ (78s—78:)} K>0 or (ii) Fy’ =0 and A <0; otherwise 
the — sign. 





The form of the equation is y= vo (eae) 
Bz, Bs, Be satisfy 
8:=3[78,+24+44 (3 — 8) V58. +4] 
. = ays and B= 3; [(858,? + 1682 — 128) 
(xx) p=2+ (582+ 4) +16 (38) — 4) 58, +4]. 
We find 


p={2 (3-82) +|3—Bo| V58,+4}/(3 — 62), 
but the only admissible value is 
p=2+(58.+4)}. 

















At the point I we accordingly find the Type (viii).curve, viz. 
M 4 
y=yoe P-w, where u=—. 
We have managed to obtain limits to the expressions 
9Ps (582 — 9) — 498, + 21082 By — 1258.3 
4[5B2* — (6 — Bo). Ba] 
9(3 — Bz) Be + 7 Bs? — 5082 By + 258° 
2[5 Bs" — (6 — Ba) Ba] , 
148," — 982 Bo — 5Bs® Ba 
5Bo*—(6 — Bs). Bs” 
5 Bs" 25 Bs* (Bs + 8) 
= Bi Gag, ond >a 


Notea. (1) P and M are imaginary for Bz< 1°8 and for such values of Bz we do 
not get a fourth order Pearson curve at the point J. 














(2) If B,=1'8, M=0 and we get y=const. 
(3) Let us see for what values of B2, P and M are positive. 
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2— 1)(5f.—9 
Now oe Poa) = 2, 
according as (Bz —1)(5B2— 9) = 4(6 — Bs), 
» ” 5 (B,—1)(B2— 3) 2 0, 
» » Bz 2 3. 
Hence P is always positive. 
Again, 3 VB.—1 = V(5B2—9) (6 — Ba), 
according as 9 (Bz — 1) = (582 — 9) (6 — Be), 


» ” 5 (Be = 3)? z 0. 


Hence 3 VB,—1 > V(5B2—9)(6 — Bz) for all values of Bp. Accordingly M >0 for 
all values of Bz > 1°8. 





(4) When f, = 3, we must find P and M from the difference equation, which is 
easily found to be 


2s +1) P%us, — 2P (2s + 3) M+? 4 (2s + 5) Pt = 9 Me? 
( ) Pas ( oe ( oe ate 


Hence for s=0, P?—6P + 5B: = — 2M, 
g=%. 3P? — 10PB, + 7Bs= — 2MB,. 
Put = ih > in the equations above and solve for P and M; therefore P=5 
4 = 

and M=5. 

When f2 = 3, By = 15 and B, = 912 we get the curve 

=( 
y = yoe®-*. 

The ordinates of this curve and the corresponding normal curve are plotted in 


Diagram IT, showing how a change in f, changes the shape of the resulting frequency 
curve. 




















40n - 
— — ~ =TYPE (vii) CURVE, B,-3.B,-15,B,-918. 
NORMAL CURVE, B,-3, Bi-15, Bz 105. 
N _ 
&-100 
30H 
mn 
ey 
5 
44207 
i 
By) ; 
“” ‘ 
/ 
/ 
10H 
/ 
/ 
tA 
‘ 
/ 
wt > I a | 
pa" 18 +2 : 3 12 +8 34 30 





-6 re) 6 
SCALE of U- 2% 
Diagram II. 
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The constants of integration for the transition types found by considering the 
areas under the curves are as follows: 


TABLE VI. Values of yo (continuation of Table IV). 


























Type Value for V Expression for y, 
Né* 
(vii) NE=Sn,|x| e m api. © I m 
mo E; (- a +e * ~T9 = 480 po? } 
> Teta, 1 /1 —2*\* - 2s 5 
(ix) | V=2y,Vp.o [: ( +5) da 3cJp In 
1/1+22\" ds Bonet | 
x Tot J Fi ie i ] = 
Be i 2y0vp | (7*) ” 2p Ik 
m Nr ; 1 
(xi) N=2y Re (fe a =)" dx 20 V9 r(3- 2) r(1+2) 
J0 3! where 1=m/q 
























































m 
PE = aN a \q@ Lp, Te 
on |. FoM | 0 (3) - 2o Jq TE+D TI) 
= aNq qo? _ N Jp 
\ a * ’ dee —_____—~f 
(xiii) | ¥ ayo’ (#555) ; 20 [(¢+p) tan-! Vg p- Vpq] 
A 2 2. V ./¢ 
a * saitiat! otek ae Tee eee oa: ars : - 
y =?  sadited Wor —_—— — 
aiap 2vf) (are) Gi 20 [(p+q) tan~' Vp]q—Vpq] 
: dp [oot =~ 4% NAJq 
(xv N=2 of (! ; :) 1x Pe —- nee hay” , Ay 
i Yq \qo?= a8) o (2Vpq—9—p flog. (/g-+/p) — log. (Va-VP)}] 
7 9 2 N 
Cg U — = os 
(xx) W=2y0 ; (= = | de er a 
0 = \po? +a"; tap) 5 - } 
a 
: B,(-a)=[" Cds and is tabled in Mathematical Tables of the British Association for the 


x 
Advancement of Science, Vol, 1. The expression is easily found by replacing the summation by one of 
Integration. Unfortunately I have not had time to try it out sufficiently, but in the trials I have made 
it gave satisfactory results. = denotes the mean absolute derivation. 


+ The reduction formula is I[,,,=I,_,- 2, I,=1, a 


2 
Ie (V(4) TR) : Jr jr (t), TAI 
new? fre “ (Dl = -r1196, I tf #) 42 4H) _ 1.91010 
+ €T@® Tw “4 w@” Te@ 
and from these values I constructed Table VII (p. 151). I also proved that 
@(1-6) 
2! 
where AI,,=1,4, — I, ete., 


T49=1,+ OA, A, +...5 


and hence we can find J,,4 from I,, I,,, ete. by ordinary interpolation formulae. 
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Note 8. The Types (xi) and (xii) are rather interesting. Consider Type (xi) which 


for convenience I shall write in the form 
b? — 2?! 
We have 


antl ee) 


= yob**1 B(s —14+ 4, 1+1) = yb! a 








TABLE VII. Table of I, and its differences. 


























a I A? LS At : a Aé it 
+ + + 
I, -429 2037 97229 | 13063 3070 
i. ‘387 8376 64951 7028 1389 
I, 356 1945 45737 4063 673 
i 331 0465 33551 2487 358 
i 310 4722 25428 1594 207 
Be -293 2529 19792 1059 116 
I, ‘278 5765 15750 730 70 
* 265 8792 12767 518 42 
Is 254 7569 10513 375 36 
e 244 9113 8778 274 19 
1; -236 1170 7418 209 we 
& 228 2005 6833 163 sad 
Ik 221 0259 5456 | 125 ee 
| ae 214 4845 4741 | 95 = 
Ty 208 4887 4151 78 v2 
he -202 9670 3657 65 » 
rt ‘197 8605 3240 53 a 
5. ‘197 86048 | 116081 5687 593 
Ty 188 70268 | 93518 3970 365 
a -180 70569 | 76641 2847 233 
a ‘173 64387 | 63735 | 2088 153 
Ty ‘167 34846 | 53675 1563 103 
Fa 16169041 | 45704 | 1190 72 
lie 15656911 | 39295 | 920 50 
ya 15190484 | 34077 | 722 32 
a 147 63353 | 29778 | 574 34 
Ty | 14370298 | 26202 | 458 10 
De "140 07021 23200 | 376 21 





I_.;=°4310 632, J,=1°0, J,.;=°711 9587, 
T,.9 = "570 7963, 1.5; ="486 1816. 


r@-OTd+) 

I (3) 7 
s=1, ot=B(Q-Df, 
Corie 
s=3,  Bys=+}B2(10B2—3), 
s=4, Be = gs Ba (1082 — 3) (1582 — 6), 


Put s=0, N = by 
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-9) _ 5-9 

esi eehe » baa 5B 3)" 

Hence in every plane of = plane f,=constant we get at one point 
of the curve of section of Fo = 0, viz. B, and B, have the values above a special 
Type (xi) curve with the constants expressible in terms of £2 only. The difference 
equation is now 





(2s + 1) a® yo, — (28 + 8) omg =2mps,, (b®=a2o°). 
2 


Put s=0, a? — 3 = 2m, 
s=l1, 3a? — 5p, = 2m, 
s=2, 5a? Bo —< 7B,= 2mBe. 


The first two of these equations give us the values found above, while the last two 
equations give us the general values. 


Il. THE TRANSITION SURFACES. 


We have seen in Section I (g) (p. 143) how a knowledge of the curves of section 
of the transition surfaces for different values of Bg will enable us to determine what 
type of frequency curve corresponds to any given values of 82, 84 and fg. 


In this Section we suppose 82 known and then deduce general formulae in terms 
of Be, to simplify the calculation of the Ay- and A,-ordinates of the transition 
curves. 


(a) The auxiliary co-ordinates r and ®. 


There is no way of finding the approximate range of values which 83, 84 and 
8. can assume. The value Professor Pearson finds for 84 for his symmetrical 
curves is 
_ 58? 
~ 6—Bs° 


The lowest possible value for fy is 82%, and I suggest taking as the range for By 
the expression 


Ba 


9 ( 5BP A:*) ee 282" (Bs — 1). 


— 


6—Ps . 6— Ps 
in other words, I assume that 
282" (B2— 1) 
2< < 2 pone 
B? = By= BP + ta, -? 


The only justification that I have for this assumption is that in all the symmetrical 
curves I have fitted I found 4 well within the limits 


Bs? (B2 + 4) 
BZ and (6—Ba) ” 

















=== 
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To make the model, I shall give 82 different values and find the corresponding 
curves of section in the different 8:-planes. The curves of section depend then on 
84 and 8, and it is an advantage to calculate the same number of S¢-ordinates for 
each curve, and at equal intervals of 84, for each 82. But the suggested range of 
possible values for 4 varies as Bz varies, e.g. the range is from 4 to 6 when B:=2 
and from 25 to 225 when ®.=5, and hence the interval between the successive 
values of 84 must vary as 82 varies. Similarly we must also have a proportionate 
unit for Be. 


All the surfaces pass through the point J, whose co-ordinates are 


Be = Be, 
5B? 
Ba= 6— Be’ 
B ae 258.° (Bo 7 8) 
$9 (6 — Ba)* 


I propose calculating the 8e-ordinates at intervals of 
_18:?(B2—1)_1/ 5P? _ ‘) 
er tee a ae | (Ya 
and taking the unit for B¢ as 


258° (82 + 8) _ B38 (124 — 982) (B2— 1) 
9(6 — Bs)? ' 9 (6 — Bs)? ; 
In this way I introduce two auxiliary co-ordinates r and ®, which I shall measure 


from the point J as origin. The relations between 84 and r, 8. and ®, r and ®, 
then become 





_ 562? rh 82? (8. — 1) 

~~ 6—-B,e 7 6—fBe ’ 

252° (B2 + 8) /Bs* (124 — 982) (82 — 1) 
96-A) /  %96-AyY — 

5Be | 7 B?(B2—1)\ 258: (82+ 8) 

a si (5 — fp? "4 ae ~ (6—PRa? 

i Bq* (124 — 982) (Bs — 1) 

9(6 — B2)* 


where ¢, is the Bg-ordinate on the surface F, = 0. 


Ba 





P = Be — 











Let us represent graphically these relations. 
Let OQ, OR represent the axis of 84 and Be, 
a! a = » rand ®, 
O be the point (83°, 82°), 


: ( 5B" 258° (Be + ») 


: 6- Ba’ 96-B 


»”» 
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R R' 
372 
oo 
4 15 S 
© 
o v7) 
": 11a 
& 
g Ble, 40=5 
alalq” 
>1% Q L l i l rn 1 I(r=0,6=0) i n l L Q' 
jor? -6 -5 -4 -3 -2 -1 414. «+2 +=«+3 +=«44 +45 +46 47 
QIK Scale of r Scale of + 
nN 
+ o= -4 
May O=-1 
26 x Q 
B,? 5B. B,?(B2*+4) 
6B. 6-B2 
Scale of Bz 


Diagram ITI. 


_ 82? (B2 — 1) 
ia Ta 
PI = Bs? (124 — 982) (B82 — 1)/9 (6 — B2)*. 


I calculate the f¢-ordinates at intervals of +OP, transfer to the point J as origin 
and divide by PJ, to give me the corresponding value of ®. 


Then 


The following table gives values of PI and 40P for different values of Bg. 


TABLE VIII. 



































5 Bo }0P PI Bs }OP PI 
1:25 01175 27117 || 35 175 176:2639 
15 03571 102315 || 4 3°4286 469°3333 
1°75 ‘07721 2°67660 45 6°75 13157125 
2 ‘14286 5°88889 5 14-2857 4388-889 
2°5 38265 21°57738 55 388922 | 24789°88 
3 ‘85714 64:66667 6 © Pa 




















G. H. HAnsMANN 155 





2 
Note that since 84= 5 Ba is negative for 8B, >6 we cannot use the r and ® co- 
6é-A 


ordinates for 82> 6. I shall not make any diagrams for such values of 83. 
(b) Section I f(a) (p. 142) gives the relations 
a 148e'— 5Ra'Be _ fo 











ae a 
gy = REO tt), 
a alee er 
$0 (823), 
t= FBR TB) oe 
where b =(3— Bo) fo— 882. K, 


B= (582 —9) fa — 40, 

K = 58.2 —6— Bs. Ba, 

L=B,? (19 — 582) — 7 (3 — B2) Ba, 

Q = 498.2 — 1484 (3782 — 30) + 78,7 (608, — 53). 
I assume for the present By + 1°8, 82+ 24, B2+3. 


i 5 Bs" 
Write i=7 a? 
m= r Bs* (Bz — 1) -1) 
7 (6- Be) 


see 258° (82 + 8) 
~ 9(6—Br* © 
* We want the values of ¢, (J + m), or f,(1+m). Using Taylor's theorem, we have 


2 
: mi 
Fe (L+ m) = fe (l) + mf,’ (DY) + tLe (l)+... 
for « =0, 2, 4, 6, 8. The expressions fo, f2, f1, Z, and Q are quadratic expressions 
in 84 and hence all their third and higher order differentials with respect to Bq 
vanish. Again, we simplify the algebra by using the result 


25838 (82 + 8 
$, (l) = se Ban 


for « = 0, 2, 4, 6, 8 since all the surfaces pass through the point J. 
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Now fo’ (Ba) = 2884 — 582%, 
0 (Ba) = 28, 
Ii = Pa = — (Be + 22), 
Fo(l) = =9Bs bo (l) = 9Be.n. 


m2 } 


Hence fo (1+ m) =fo(l) + mfr’ (I) + ae (2) 
=9Be.n+— r B® (Bz — 1) 582° (82+ 22) 17? r Bs* (B2— 1) 














7 6-Be 6—Be 249 (6-2) 
<a do (lL + m)—n 
i ‘Po BF (124 — 98s) Ga 1) 
9 (6 — Ba)? 
r {2r (Bs — 1) +5 (Bs + 22)} 
7 (124 — 9Bs) 
Similarly, after lengthy algebra, 
®, = r {rB2(B2 — 1) + 60 (282—5)} L...(12) 


haptic a 


‘~~ (5B2—9)(124—98s) 


Oe" = r  {rBs (Be re 1) (Be 24 3) v defy} 





7 (B2—8)9(124-9R2) ’ 
where 
v = 328." — 18982 + 297 F {(6 — Bs)? (B2—1) [(582— 9) + 7 (Ba —3)}}3, 
@,!)"" = " {rBe (3182 en 51) i 4x} ‘ 
P 21 (124 —983)(7B2—15)’ 





where 


x = 9 (8382 — 183) F (6 — Re) [84 (582 —9) + 72r (282 —5) + Bo2r?}h. | 





Note Dg’ > Pe”, Ds’ > @,” according as Bz = 4. By giving A, its constant 
value and putting r=—7 to r=+7 we can easily calculate the ordinates. 


Not only have I solved the difficulty of choosing a suitable scale for 84 and By 
and so made the construction of a model practicable, but I have simplified the 
calculation of the ordinates of the transition surfaces considerably. 
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(c) Calculation of the Ordinates. 


(i) Substituting for 8, in Equations (12) we find the following relations between 


Dy, P,, Py, and r for different values of Be: 


TABLE IX. Values of ®o,... By for values of Bs. 


































































































8, Dp P, D, 
l r 187 4r 
7 161 23 
198 r [465 +27] r [2400 —5r] r [880 — 5r] 
is 3157 22099 4961 
ne 7 [235+ 2r] r[160-r] r[40-r] 
. 1547 1547 221 
a r [475467] 7 [1440-217] r [80-217] 
gs ~ 3031 15155 433 
a r[60+r] r[30-r] r[10+7] 
* 371 371 53 
ne [245 + 6r] — 15r2 r [280+ 157] 
wise ~ 4427 ~ 1421 1421 
; r [125+ 47] r{20+r] 
: —- ‘a 97 
Pe r (51427) r(96+7r] r[136+7r] 
BS 259 "359 629 
r[45+3r] r [65439] r(55+3r] 
4 154 308 249 
Pe r[265+14r]} r [320+ 217] r[120+7r] 
#s — 1169 1169 501 
‘ 7 [135+48r] 7 [150+ 107] r[80+5r] 
553 553 316 
i r [275 +187] r [1440+ 997] r [1480 +99r] 
ae 1043 5215 5513 
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In the same way we find the following table, which gives the relations between 
®,, P; and r for different values of Be: 


TABLE X. Values of Dg and Pg for Values of Be. 


























































































































Be Dz Ds 
‘ 4r r {3600 + 20r + 20 [r? — 216r — 336]2) 
23 19320 
195 | 7 [28352-35152 (—19.7r+11)*] | [456484 2457+ 76 (25r2—2880r — 3696}*] 
* 154693 236775 
a r[304—r $12 (—6.741)3] r[312+r+4 (r.7— 64 -56)?] 
: 1547 | 1547 
| 
1 
rez | 7 [16448 - 105r5 136 {51 (—5r-1)}8] | [21744-91768 {4972-17987 -336}4] 
. 75775 100023 
: r [94-1 F(256.T—7)] * » [306 —11r+8 (84—r.72—4r)!] 
S 371 1113 
- r[784—15r% (42.77) ] [3528 + 2657 F 28 (1176425722) 
By 1421 21315 
sieunaiiieing 
3 r [600 +7 (36+7)] 7 (1324+ 77 F2 (56+7.8+7)?] 
; 2716 2716 
ie r[17647rF 20 (2.17 4r)5] 7 [3096 4-161rF 4 (2856 +r. 5764407)" 
gts ~ 259 | 14763 = 
, r[5843rF (24. 11 +7)*] [13414737 $2 (92447. 9164 16r)4] 
ini 154 sj 6006 ve. t: 
se r (386+ 21r%4 (14.947)9)] r[30484177r £4 (504+, 1284 97)9] 
is 1169 12859 
: r [1524 10r F (64487) ] r [2088 + 130r F (1344 +7.3604- 207)! ] 
553 8295 
a r (7216+ 495r F 12 (74 + 10r)9] r [39384 + 2629r F 4 (6216 +r. 1728+ 1217) ] 
26075 147063 
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(d) From the previous tables it is now an easy matter to calculate the ®-ordinates 
of each of the transition surfaces for integral values of r from —7 to +7 and Bs from 
1°25 — (25) — 2°0 and 2:0—(°5)— 5°5. The results are given in Tables XI—XXII 
and the ordinates are plotted in Diagrams VI—XIX. 


TABLE XI. The Ordinates of the Transition Curves for Bz = 1°25. 


















































r | ®o _ aE: ®, $,” | $, $,” 
a — = 

—7 | —1°00000 ] — *77130 | —1-29107 | — 110922 | —1°47886 | —1°59766 | —1-00000 
—6 | — 86094 | —-65976 | —1°10058 | - -96474 | —1-25090 | -1-35127 | — -88771 
—5 | — *72062 | —-54867 | - 91211 | — “81714 | —1-02696 | —1-10891 | — -76726 
—4 | — +57903 | —-43803 | — °72566 | - -66610 | — -80737 | — -87104 | — -63817 
—3 | — °43617 | -*32784 | — 54122 | — -51124 | — °59250 | — -63829 | - -49983 
—2 | — *29205 | —-21811 | — °35880 | — -35263 | — 38230 | - -41131 | - °35158 | 
—1 | - “14666 | —+10883 | -_ te) = pS oe 
SOPs LIS | ‘ we ian 

| 

+1 | + °14793 | +:10838 | + +17638 | _ | — — — 
+2 | + -29712 | +°21630 | + °35074 — —- | -— | = 
| +3 | + °44758 | +°32377 | + °52308 —. | —-}; — 
| +4 | + °59930 | +°43079 | + °69341 | | — | — 4 — 
+5 | + 75230 | +°53735 | + °86172 | : - —- | — 
+6 | + "90656 | +°64347 | +1°02802 -- ON ee ie 
+7 | +1°06208 | +°74913 | +1°19230 | — — ae ae ee 

TABLE XII. The Ordinates of the Transition Curves for B2=1°5. 

r &, | ’, | &, | o, | ®,” | &, | ,” 

oe, See, Rene Meee eee oe : | 

—7 | —1:00000 | — *75566 | — 1°48869 | —1°08145 | | — 100000 
—6 | — -86490 | —-64383 | —1-24887 | - -94741| —.. | — *89083 
—5 | — *72721 | —-53329 ne — 1-01810 | — 80870 | —1-18871 | —1°21202 | — -77246 
—4| — -58694 | —-42405 | — -79638 | — -66474 | — 92802 | — -94839 | — +64438 
—3 | — -44086 | —-31610 | — 58371 | — -51473 | — 67596 | — -69263 | — 50582 
~2| — -29864 | —-20944 | — -38009 | — -35760 | — -43361 | — -44586 | — -35569 
—1 | — *15061 | —+10407 ‘ty "18552 | — +19716 | — -19716 | — *20879 | — -19328 
+1 | + °15320 | +°10278 | + °17647 — — — —- 
+2 | + *30899 | +°20427 | + °34389 — - - — 
+3 | + *46736 | +°30446 | + +50226 - — — -— 
+4 | + °62831 | +°40236 | + °65158 ~~ — —_ — 
+5 | + *79186 | +°50097 | + *79186 _ -- — — 
+6 | + 95798 | +°59729 | + -92308 — -- — - 
+7 | +1°12670 | +°69231 | +1°04525 _- _ — — 
































— Denotes that the value of @ is imaginary. 
. Denotes that ® is now a large negative quantity. 








160 On certain Non-Normal Symmetrical Frequency Distributions 


TABLE XIII. The Ordinates of the Transition Curves for Bz = 1°75. 














by &, &, &,' ®," ,’ ,” 
—1-00000 | —-73303 ~ 1°06418 BF i — 1-00000 
— -86902 | -°61999 ne ~ 93813 | —1°76641 | —1°78012 | — -89407 
— ‘73408 | —-50973 | —2°13626 | — -80600 | — 1-43392 | —1-44152 | — -77788 
— 59518 | —-40224 | —1°51501 | — -66695 | —1-11390 | —1-11741 | — -65082 
— *45233 | —-29753 | — -99076 | — -51979 | — -80754 | — -80869 | — °51203 
— *30551 | —-19558 | — -56351 | — -36277 | — -51657 | — -51672 | — -36012 
— *15473 | —-09640 | — -23326 | — 19281 | — -24408 | — -24405 | — -19255 
+ 15869 | +-09363 | + -13626 a = en je 

+ 32135 | +°18449 | + °17552 = ane es wer 

4+ °48796 | +°27258 | + 11778 Re aS ae se 

+ °65853 | +°35790 | — -03695 es = a sa 

+ °83306 | +°44045 | — -28868 ee ie ms <_ 
4+1°01155 | +°52022 | — 63741 = 3 ae - 
4+1°19400 | +.59723 | —1-08314 a a = 


























TABLE XIV. The Ordinates 


of the Transition Curves for Bz = 2. 






































® ®, ®, ®, $," ®, $,” 
—1°00000 | ~-69811, | — -39623 | — 1-07066 — 100000 
— 87332 | —-58221 | — -45283 | — -94881 — *89745 
-- *74124 | —°47170 | — 47170 | — -81952 ee a — °78351 
— *60377 | —*36658 | — -45283 | — -67087 | —1-44234 | —1-85820 | - -65752 
— 46092 | —-26685 | — -39623 | — -52561 | —1°04313 | —1-30901 | — 51848 
— °31267 | --17251 | — -30189 | — -36812 | — -66692 | — -81395 | - -36484 
— °15903 | —-08356 | -— -16981 | — -19507 | - -31706 | — °37574 | — -19390 
+ 16442 | +°07817 | + -20755 | + -25067 | + -25067 | + -23630 | + -29380 
+ °33423 | +°15094 | + -45283 = a 
+ *50943 | +°21833 | + -73585 ae) = 
+ *69003 | +-28032 | +1-05660 a 
+ *87601 | +°33693 | +1-41509 _— — 4 - . 
+1:06739 | +°38814 | +1-81132 —_ | - 
+1°26415 | +°43396 | +2-24528 Ae we - 








TABLE XV. 


The Ordinates 








®, 


— 1°00000 
"88248 
*75651 
— +62210 
*47924 
°32794 
"16819 


“17664 
"36172 
°55524 
*75721 
‘96763 
+1°18649 
+1°41379 








*26390 
*16890 
“09500 
— °04222 
“01056 


*OL056 
— *04222 
“09500 
*16890 
*26390 
“38001 
*51724 


2, 


*86207 
*80225 
°72132 


*18649 


*20760 
*43631 
“68614 
+ °95707 
+1°24912 
+ 1°56228 
+1°89655 


“fe 
7 
oa 





“61928 | 
“49613 | 
"35186 | 





— 1°03466 | 
“92780 
*81070 
“68167 
*53871 
*37948 
*20109 


+ *85397 
+1°63231 
+ 2°32636 
+ 2°92273 
+3°39769 
+3°69653 
+3°34483 





” 
De 


~ 191187 
“92347 


*22837 
“49014 
*79397 
+1°15327 
+ 1°59175 
+2°16413 
+ 3°34483 


+++ 





ri) i‘ 





“09885 
“18641 
*23803 
*25455 
*23715 
*18746 
"10756 


+ °22347 
+ °47461 
+ °75595 
+ 1°06959 
+1°41714 
+1°79979 
+ 2°21839 


— 1:00000 
“90465 
“79551 
“67175 
*53217 
“37515 


*19861 


+ °13243 
+ °28692 
+ °46094 
+ °65239 
+ °85966 
+1°08156 
+1°31724 








— Denotes that the value of & is imaginary. 
... Denotes that ® is now a large negative quantity. 
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TABLE XVI. The Ordinates of the Transition Curves for Bz=3. 

















Tr ®, &,* , ®,” * #,” ,’ ,” 
-7 | —1°000 000 — °938 144 | —1°023 196 — *711 340 | -— 1°000 000 
—6 | — *892 489 — *865 979 | — :927 835 — °678 057 | — °912 517 
-5 | — +773 196 — *773 196 | — °819 219 — °*619 984 | — -808 588 
—4}]— °642 121 ea — "659 794 | — °695 140 a — *538 150 | — °687 182 
—3 | — *499 264 — °525 773 | — °553 387 — *433 846 | — °547 008 
—2|— °344 624 — °371 134 | — °391 753 — °308 493 | — °386 647 
—1 | — +178 203 — *195 876 | — °*208 027 — °163 476 | — °204 713 
+1 | + °*189 985 + °216 495 | + +234 536 + °228 460 | + °180 965 
+2) +4 °391 753 + °453 608 | + °497 791 + °481 401 | + °378 687 
+3 | + °605 302 + ‘711 340 | + °791 973 + °*759 358 | + °592 630 
+4 | + °830 633 we) + °989 691 | +1°119 293 me +1°062 716 | + °822 409 
+5 | +1°067 747 + 1°288 660 | 4+-1°481 959 +1°391 753 | +1°067 747 
+6 | +1°316 642 +1°608 247 | +1°882 180 +1°746 666 | +1°328 445 
+7 | +1°577 320 +1°948 454 | +2°322 165 +2°127 600 | +1°604 359 





























TABLE XVII. The Ordinates of the Transition Curves for 82=3'5. 























| 
r Po ?, | ®, | ®, | &,” $,’ *,” 
EMG SEIS ARE: FE porns 
- 7 | —1:000 000 | —1°270 270 | — *968 203 | — 1°015 062 | | — °867 236 | —1-000 000 
-—6 | — °903 475 | -1°250975 | — °896 661 | — °931 082 | = | — °811 839 | — °919 516 
-5 | - °791 506 | - 1°177 606 | — 802 862 | — °830 510 | | — °728977 | — °822 875 
—4 | — °664 093 | —1°050 193 | — °686 804 | — +710 728 | _ | — 624 750 | — °703 978 | 
—3 | — ‘521 236 | — *868 726 | — °548 490 ts “569 536 | _. | — °498 792 | — °563 187 | 
| —2 | — °362 934 | — °633 205 | — °387 917 | — :405 062 — — °352 184 | — *399 425 

~—1 | — +189 189 | — +343 629 | — +205 087 | — -215 687 | —1-089 332 | - -186 003 | — -211 613 | 

| | | | | | 

| 

| 


ae 
“397 683 *243 243 | + °236 605 Pe: *204 633 








| 

+1 | + °204 633 | + + °227 345 | +1°169 884 | + | 

2 | + °424710 | + *849 421 | + °476948 | +2°419 215 | + °515 148 + °498 185 : “427 914 
+3 | + ‘660 232 | +1°355 212 | + -748 808 | +3°747 001 | + °816 705 | + *784911 | + °669 671 
+4] + °911 197 | +1°915 058 | +1°042 925 | +5°152 352 | +1°148 807 | +1°095 818 | + °930 871 
+5 | +1°177 606 | +2°528 958 | +1°359 300 | -—--—--— | +1°512 259 | +1°434 238 | +1:°208 179 | 
+6 | +1°459 459 +3°196 911 +1°697 933 | —*-—*— | +1°907 801 | +1°796 979 | +1°504 789 
+7 | +1°756 757 | +3°918 919 | +2°058 824 | —---- +2°336 106 | +2°185 167 | +1°819 574 

| 








TABLE XVIII. The Ordinates of the Transition Curves for Bz = 4. 











| 
r | by by ®, ®, ," | &,’ | &,” 
| | | | i 
Cee” SSBC asi Fae eee Paes 7 “7 Se =e one 
71 -1:000000| ... ... .. | — -983 471 | -1-009 184] ... ... ... | _ -934 732 | — 1-000 000 
—6 | — °915 584 | —1°051 948 | — -917 355 = O96 G40} ..s5 Boke | _ -873 561 | — -930 635 
-5 | - +811 688 — 974026 | — °826 446 | — -844 156] ... ... ... - — 513 | - "838 529 | 
4 | — °688 312 | — °857 143 | - -710 744 | — °728 273] ... ... ... | 4957 | — °722 313 | 
-3 | — *545 454 | - -701 299 | - +570 248 | - -587 213 | —1°127 073 | - "539 966 | - 580 913 
2 | 383 117 | — °506 494 | — -404959 | — 419520 | - 801 259 *382 231 | — °413 640 
-1 | — *201 299 | — °272 727 | — °214 876 | — 224079 | -— +425 202 162 | — *220 082 


| 
—o 
| 


bo 
Y 
bo 
C te 








| 
| — | —= 
| 

+1 | + °220 779 | + -311 688 | + +239 669 | + °473 835 | + °253 438 | + °246 753 | + °224 109 
+2 | + °461 0389 | + *662 338 | + +504 132 | + +995 630 | + °536 838 | + °520 268 | + °470 075 | 
+3 | + °720 779 | +1°051 948 | + °-793 388 | +1°564 876 + ‘850 708 | + °*820 604 | + °*737 838 
+4 | +1000 000 | +1-480 519 | +1°107 438 | +2°181 134 | +1°195 489 | +1-146 468 | + 1-028 691 | 
+5 | +1°298 701 | +1°948 052 +1°446 281 +2°844 024 | +1°571 561 | +1°500 218 | +1°340 276 | 
+6 +1°616 883 | +2°454 545 | 4 1°809 917 | +3°553 208 | +1°979 259 | +1°882 872 +1°671 574 
+7 | +1°954 545 | +3°000 000 | +2°198 347 | +4°308 392 | +2°418 881 | +2°290 781 +2°026 236 

| 
* See (f) (iv) on p. 164. —+-—-+— Denotes that & is a large positive quantity. 
— Denotes that the value of Pisimaginary, = = —... ... ... Denotes that # is a large negative quantity. 
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TABLE XIX. The Ordinates of the Transition Curves for B2= 45 















































r Bo , ®, o,/ $," &, #,” 

-7 | —1°000 000 | —1°035 928 | — *992 016 | —1°004 994 | —1°258 479 | - -969 516 | — 1-000 000 
-6 | - °928 999 | — *995 723 | — °934132 | — -944 792 | -1°210 896 | — ‘912 209 | — 941 123 
—5 | — *834046 | — -919 589 | — *848 303 | — °859 995 | —1°116 053 | — °*826 372 | — °855 718 
—4 | — °715 141 | - °807 528 | — °734531 | — ‘747 763 | — -976 788 | — °713 384 | — °742 405 
-3 | — °572 284 | — °*659 538 | — °*592 814 | — *606517 | — °794 681 | — *573.985 | — *600 446 
—2|—- *405475 | — °475 620 | — °423 154 | - °435 247 “570 741 | — +408 522 | — °429 490 
—-1 | — *214713 | — *255 774 | ~ °225 549 | — -233 249 | — °305 673 | — °217 164 | — °229 371 
+1 | + *238 666 ‘3 *291 702 | + °253 493 | + °345 876 | + °264903 | + °258 673 | + *242 922 
+2 | + °501 283 *619 333 | + °534 930 | + °731 632 | + °561 782 | + °546 676 | + °511 571 
+3 | + °787 853 = “982 891 | + °844 311 | +1°157 004 | + *890 900 | + °864 029 | + -*805 930 
+4 | +1°098 375 | +1°382 378 | +1°181 637 | +1°621 772 | +1°252 479 | +1°209 498 | +1°127 231 
+5 | +1°432 849 | +1°817 793 | +1°546 906 | +2°125 749 | +1°646 707 | +1°586 826 | +1°471 732 
+6 | +1°791 275 | +2°289 136 | +1°940 120 | +2°668 771 | +2°073 744 | +1°992 285 | +1°843 161 
+7 | +2°173 653 | +2°796 407 | +2°361 277 | +3°250 698 | +2°533 733 | +2°427 124 | +2°240 268 

TABLE XX. The Ordinates of the Transition Curves for Bz = 5. 

r by &, &, ®,! B,” &,’ b,” 

7 | —1°000 000 ; —1°012 658 | — ‘996 835 | —1°:002 172 | -1°073 77 — 988 186 | —1°000 000 
—6 | — °943 942 is ‘976 492 | — *949 367 | — °954 792 | —1°041 591 | — °939 483 | — °952 742 
—5 | — °858 951 | — °904 159 | — °870 253 | — °877 948 | — :966537 | — °858 951 | — °874 623 
—4|— °745 027 | — °795 660 | — °759 494 | — -769 209 | — 851044 | — °747710 | - ‘764 526 
—3 *602 170 | — *650 995 | — °617 089 | — °627 534 | — °696 155 | — °606 107 | - 622 102 
-—2 | — *430 380 | — °*470 163 | — °443.038 | — °452 339 | — °502 453 | — 434 260 | — °447 235 
—1 | — ‘229 656 Se *253 165 | — °237 342 | — °243 249 | — 270313 | — °232 216 | — °239 875 
Se. | eee . ae pel 4 ee tas 
+1 | + °258 590 | + °289 331 | + ‘268 987 | + °308 292 | + ‘277 603 | + °272 403 | + °262 378 
+2) + °546112 | + °614 828 | + °569 620 | + °654 410 | + °589 713 | + *577 340 | + *554 908 
+3 | + *862 568 | + *976 492 | + ‘901 899 | +1°038 232 | + ‘936 451 | + °914 455 | + °877 950 
+4 | +1°207 957 | +1°374 322 | +1265 823 | +1459 660 | +1°317 917 | +1°284 835 | +1°230 414 
+5 | +1°582 278 | +1°808 318 | +1°661 392 | +1°918 608 | +1°734 195 | +1°687 397 | +1°613 386 
+6 | +1°985 533 | +2°278 481 | +2°088 608 | +2°415 005 | +2°185 356 | +2°122 505 | +2°026 501 
+7 | +2°417 722 | +2°784 810 | +2°547 468 | +2°948 791 +2°590 158 | +2°469 758 

| 














| 


+2°671 463 | 








TABLE XXI. The Ordinates of the Transition Curves for a 5: 














?, 
—7 | —1°000 000 
—6 “960 690 
-5 *886 865 
—4 | — °778 523 
—-3 | — *635 666 
| -2 |) -— *458 293 
| —1 | — *246 405 
+1 | + *280 920 
+2 | + °596 357 
+3 | + °946 309 
+4 | +1°330 777 
+5 | +1°749 760 
+6 | +2°203 260 
+7 | +2°691 275 














—1°002 685 
‘973 346 
*906 040 
*800 767 
"657 526 
‘476 318 
*257 143 


*295 110 
*628 188 
"999 233 
"408 245 
+1°855 225 
+ 2°340 173 
+ 2°863 087 


- 
+ 
an 
+1 


®, 


*999 274 
"964 266 
*893 343 
*786 505 | 
*643 751 
*465 OS3 
*250 499 





"286 414 
*608 743 
*966 987 
1°361 146 
+1°791 221 
+2°257 210 
+2°759 115 


++++4+ 





|— 
‘i 
. 





-l 000 5 oe 37 
“966 696 
"897 836 
*792 488 
*650 209 
‘470 782 
*254 O75 


*299 942 
*638 339 
“O15 154 
*430 355 
*883 917 
“375 818 
“906 040 





— 1013 423 
‘987 360 
“920 381 


*668 
*484 309 
*261 438 


*291 506 
+ *620 491 
+ 986 994 
+1°391 045 
+ 1°832 670 
+2°311 890 
+ 2°828 725 


ob 


*813 955 | 
525 | 





“997 335 
*960 930 
*888 767 
‘781 465 
“638 844 
*461 002 
*248 O80 


+ °288 123 
+ ‘612 596 
+ °973 420 
+1°370 595 
+1°804 121 
-'-2°273 998 
+2°780 227 





®,"’ 


1°000 000 
“965 592 
*895 434 
*788 910 
“646 197 
"467 199 
*251 774 


*283 238 
+ *601 632 
+ °955 182 
+ 1°343 889 
+1°767 750 
+ 2°226 767 
+ 2°720 940 


+h 
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(e) The limiting curve 8, = 8284; the ® co-ordinate of the second order Pearson 
curve. 


(i) I have calculated ordinates at values of r from —7 to +7 and found the 
corresponding ®-ordinates for different values of 82. All these values of ® will not be 
possible because 82, 84, and 8, must satisfy the inequalities 84 > Rx®, By > B2Ra>Be*. 
For all values of r from — 7 to + 7, 84 > 2? and By > 82°. Let us consider now the 
surface 8, = 8284 and express this relation as an equation in r and ®. 


@= {8 25,3:8 (Bs + a ~ 98») (Bs — dy 














~ 9(6—B2)* J 9(6— Bs)? 
g, | 5B" , 7 Bat (Ba — a _ 25B:* (Be + 8) 
_"*\6—B2 | 7 6-B: 9(6—Aae 
is As* (124 — 982) (Bs — 1) 
9(6 — 82)? 
since Ra= 5Bs" re 82° (B2— 1) 





~6-B2 7 6-2 

Hence, on simplifying we find 
_ 9r (6 — Bs) — 490 
= “7 (124 — 9B) 
Let us denote this limiting surface 8g = 8284 by F,=0 and this limiting value 
of ® by ®,. For given 2, ® lies along a straight line which passes through the 
point (r=—7, ®=—1). I shall accordingly calculate ®, for r=0 and 7, and for 

different values of B.’. 





TABLE XXIla. 





























By 1(0) ®,(7) B2 | ,(0) ,(7) 
1°25 — 621 — *242 3°5 —*757 — ‘514 
15 - “634 — 267 4 — *796 —*591 
1°75 — “647 — *293 45 — *838 — “677 
2 — *660 —*321 5 — *886 — ‘772 
2°5 — *690 — *379 55 — *940 — 879 
3 — °722 —"443 || | 

a 








(ii) The values of 84 and 8, corresponding to the second order symmetrical 
Pearson curves are 





inet ee i 358° 
a= = By’ Le. r= 0 and Bs= — 289) (6 — Bs)” 
Hence =! 358. _ 258° (Bs xn fe (124 — 982) (82 — 1)) 
\(9—282)(6—B2)  9(6— Ps) J, 9 (6 — 82)" Fe 
10 (582-9) | 


~ (124 — 982) (9 — 2R2) 
Note that ® is negative for 8g < 18 and for 82 > 45. 
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If Bg=4°5, ® becomes infinite. By substituting for B2 we can find the ® 
ordinate corresponding to that value of 82. We find 


TABLE XXIIb. 








Be ® Bs ® 
1°25 — °038 2°5 + 086 
1°5 — 023 3 + *206 
1°75 — °002 3°5 + °483 
2 +°019 4d +1°189 























(f) I will now discuss the curves of section corresponding to critical values of Bs, 
viz. B= 1, 18, 24, 3. 


(i) Bg=1. We now have the “two-lumps” distribution for which Bg=84=8.=1. 
Now the points 0 and J coincide, in other words, for all values of r, 84 always 
remains unity. I have added the curves of section in the case 82 = 1 to each of the 
contour diagrams for the sake of completeness. 


(ii) Bz =1°8. The curve of section of the surface F,=0 reduces to the two co- 
incident straight lines 84= 42. The discussion is similar to that for 82 = 3. 


(ili) Bg=2}. The equation for the surface F;=0 simplifies. Here also the 
discussion must unfortunately be omitted owing to lack of space. 


(iv) Bg =3. The section of the surface F,=0 by the plane §: = 3 reduces to the 
two parallel straight lines By=15 and 78,4 = 45, i.e. r=0 and r=—10, whereas 
the curves of section of the surface Fg =0 reduce to the straight line 8g= 15 and 
the parabola 38, — 275 = 4984? + 4284 + 99465, Le. 





_ B8.—275 r[600+36r+r7]_. 8 os Se 
®, = — 7 7 on a716 , since By = 15+ 7 
Now m,=9 ee s. 
9(Be- 0) 


mPa 15 = Ba) ’ 
qa + pr=—} (78a — 45), 


1296, — [498.3 — 6938.2 + 931584 — 30375] 
36 (15 — Ba) , 


Ba)? (Bs — ds’) (Be — $s’"). 


(qi = pay = 
81 


5 = 


m* — (qi — pi)? = 4a 


The values of the ordinates Bp, &4, Pg, Py are shown in Table XVI, p. 161, 
and the transition curves drawn in Diagrams XVI and XVII (right-hand figures). 
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We find that F, =0 lies below both Fy=0 and F,=0 for By < 15 and above for 
84> 15. Hence for points on Fy = 0, m < 0, pigi > 0 and gq; + p: < 0, and so we have 
the Type (vii) curve. Between the line 8y= 15 and the curve F, = 0, (qi —pu*< 0 
and we find a heterotypic area. 












TRANSITION CURVES for B,=1'8 
z 
mal 
oe 
Z 
=) 
ar) 

al IMPOSSIBLE AREA 




















o SCALE of B, a" 


Diagram IV. 


Again, the area which corresponds to the Type (iii) curve, viz. that between 
Fy=0 and F, = 0, always lies between the two branches of 7, =0 and hence the 
< 1 is always satisfied. Furthermore, points on F,’ = 0 correspond 


qi— pr 
to the Type (xiii) curve when 84 < 15 and to the Type (xv) when 84> 15; on the 


other hand, for F;” = 0 we first get the Type (xv) for 84 < 15 and then the Type (xiii) 
for 8B > 15. 


condition 








(1) Bs< 108. 


TABLE XXIlIla. 
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Signs of | 
Zone - —_— + Type 
m, NP; UtPy 
Above F',=0 + - - (i) 
On F,=0 0 _ - (viii 
Between Hes -. ed 
F,=Oand F,=0 2 (iv) 
On /y=0 - o | - (xii) | 
CiAISb>. Hae) 
Below #,=0 - + | - (vi) 
TABLE XXIITIb. 
(2) 108 < By < 15. 
Signs of 
Zone - | Type 
m NP; P| q +p 
Sarat Pte Sete Sea 
Above Fy=0 | + - ~ (i) 
SSSR S bevels Sack = 
ae | 
On F)=0 im + | 0 | (xi) 
Between | | | 
> BS | fass\ 
Fy=Oand F,=0 | ee ee. | | (iii 
|. = ——_—|_— 
On F,=0 ma + - | (viii) 
Below F',=0 Le.® + - | (vi) 
a i — ie 
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(3) By>15 TABLE XXIIlc. 
3) B,>15. 









































| 
Signs of 
Zone ———-— Type 
my nPi atPy 
Above F,=0 - 4 - (vi) 
EO Gas Sree es 
On F,=0 0 ie ee (ee (viii) Spage 
is MESSE ae 
Between ie = (iii) 
F,=Oand F,=0 Sadie / 
On F)=0 i Sz = (xi) 
eo [Ses ee Ke 
Below =O | + Cs _ (i) 
a $$ — $4 





















=| CONTOUR DIAGRAM fer BO, and por F negative 














4 3 Re oe 
SCALE of T ea: 
Diagram V. 
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(g) The contours of the transition surfaces. 


In plotting the ®-ordinate for different values of r, we find it convenient to 
make separate diagrams for r>0 and r< 0. 


(i) The con:our diagrams for the surface Fo = 0 are straightforward. Note that 
each curve of section must pass through the points (— 7, — 1) and (0, 0) and that 
®, decreases «3 Bz increases. See Diagrams V and VI. 


37 
a Va CONTOUR DIAGRAM for fi<Q. and for negative \ 
2 »D 

















8 TIVOS, 











Diagram VI. 


(ii) The contour diagrams for F;=0 and F,=0, shown in Diagrams VII, VIII 
and IX, present greater difficulty and we need more ordinates, Ax before we easily 
find the ®-ordinates for 
r=—7-(10)—+7 and Bg=1°'7 —(:15)—1°9 and 8.=2°6 (—1) —2°9, 3°1, and 3:2. 

Let us now consider the contours of the surfaces of Fz=0 and Fy=0. When 
r<0., increases : ak Phas : 
eee de Nciigianes ™ Sz increases till it becomes infinite at 8, = 3. It now changes 


increases 
’ decreases” 


sign and as fz, is further increased 


On the other hand, for 7< 0, da 
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e CONTOUR DIAGRAM for FO, and for t negative. 
1 
® 
1 
2 om 
3} 


¥ 














sane s 
Diagram VII. 





j 
i) 


‘ 


SONTOUR DIAGRAM fer FQ and fer T positive. 
































Diagram VIII. 





























al] CONTOUR DIAGRAM for Fe=O, and for V negative. 
=2H 
-3H 
=—-4H 
9 
= 
° 
“ 
a 
/, 
~7| i Yh 
4 
4 
4 
4 
% 4 
=-8H oO /p Bot “a 
<8 / 
ta? te 
¢ ¢ 4 
, ’ 7 
4 / 
-9} se ’ 
2 a a 4 
- of 7 
+i va 7 
o% 7 / aS 
tod o 7 
B) = vd “4 SCALE of r~5 


Diagram IX. 























s that the curves 
rough the pont I. 


{Denote 
poss thi 


The arrows denote the points of 
intersection of the curves for the 
value of By shown in circles. 
Note that only pieces of the 
upper branches are shown 


2s 

















-2 


“4 SCALE of r 
Diagram X. 
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decreases, becomes infinite at 82=1°8, where it changes sign and then decreases 
again; but if r > 0, oa decreases, changes sign at 82 = 1°8, decreases till 8, = 2°5 and 
then increases again. 


The contours of the surfaces F, =0 and F,;=0 are more complicated and I have 
omitted several of the lines to make their reproduction possible. Note that since 
the surfaces consist of two parts, each section gives two contour lines. If r< 0, the 
contour lines of F,’=0 are such that each member of the system cuts all the pre- 
ceding members, e.g. the contour line for 82= 2 cuts the contour lines for 8, = 1°75, 
8, =1°'5, and 8,= 1:25. Furthermore, ¢,”’ decreases as 82 increases, becomes infmite 
at B2=3, but does not change sign and then increases. Again, the lower of the 
contour lines for the surface F,=0 always passes through the points (0, 0) and 
(—7,—1), whereas corresponding ordinates on the upper branch decrease until 


82=2}, where they become infinite, change sign and then decrease again, as Be is 
further increased. 


The contour diagrams for F,= 0 and F,=0 simplify for r > 0 because there are 
now fewer lines. gg’ increases as fz increases, becomes infinite at 8Bg= 3 and now 
decreases as fz increases. Fy’’ = 0 represents a system of curves, such that each 
member cuts the preceding members. Both ¢,’ and ¢,’’ increase as fg increases, 


(h) The Transition Curves. 


Diagrams XIII to XXI show the sections of the transition surfaces by planes 
corresponding to different values of 82 and the frequency types corresponding to 
the different areas*. These areas are determined as in Section I (g), p. 143, but 
I have now introduced the conditions 


81 (82-3)? 











(qi — p~a)* = 4K? (8s — $e’) (Be — $e’) 2 0 
243 (Ba—1) (72-15 : Mi 
and my" — (qi — pi)? = (Bs ae ol (Bs — ds’) (Bs — $3") = 0. 


Accordingly the zones between the two branches of F, =0 and also those parts 
of the zones for the main Types (iii), (iv), and (v) and the transition Types (x) where 
m," —(q1 — pi)? is not negative, are now heterotypic areas, points of which do not 
satisfy the condition c, = 0. 


Furthermore, note that the Type (vii) can only occur when F, = 0 borders an 
area corresponding to the Type (vii) curve. Again, the Types (xiii), (xiv), (xv) must 
occur in the areas of the Type (i), (ii), and (vi) curves respectively. At all other 
points of F,=0 and F,=0 we get a different type of curve. 


Clearly m2 —(q: — pi)*< 0 if we are outside the two branches of F, when 83 < 15 
or if we are inside these two branches when A: > 4. 


* The heterotypic areas are the zones where we may get other types of curves, but not those 
considered above, and the impossible areas are those wherein the necessary inequalities 8, > 8,*, 
By > 8.8, > 8° do not hold. 
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Diagram XIX. 


Note (1) that the sign of K does not trouble us. 


(2) When F,’ = 0 and F,”’ = 0 become imaginary, (q: — 1)” is always positive; 
when F,’ = 0 and F,’’=0 become imaginary, m2 —(q, — p1)* is positive or negative 
ginary, 

mardi . 15 
according as Ay >or < 4A, 


In (f) of the first Section (see p. 141) I gave the values of 84 and B, at the 
intersections of the transition surfaces with each other, but all these values do not 
come into the diagrams of the transition surfaces because I have given an upper 
limit to 84. Table XXIV gives the values of » and ® at the intersections of the 
transition curves for different values of 8,. These values check the Diagrams XI 
to XIX. 


By painting the transition curves on glass and fixing the sheets corresponding 
to each value of 8 in their right order, one above the other, we can make a model 
in the Bz, 84 and 8.-space showing the zones corresponding to each of the fourth 
order Pearson curves, But unfortunately the curves are so close to each other that 
the model becomes too complicated and its reproduction in print would be impossible. 
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TABLE XXIV. 





























Co-ordinates of points 

Bo Surfaces which intersect nes asscatensandans 

r ® 

1°5 F, =0, Fi =0, and F,=0 — 1°302 — °2437 
j F,=0 and F,=0 5-0 +°7919 
Fy’ =0, Fi =0 and F,=0 —1°333 — °3326 
75 3. =0 and F,=0 — °409 — 0835 
. F,=0 and F,=0 + *556 +°0877 
F,=0 and F,=0 +1°905 +°1759 
2-0 #y)=0 and F,=0 — 1°667 —°2621 
F.=0 and F,=0 —5°0 — °4716 
+5 Fy’ =0 and F,=0 — 4°807 — *2439 
7 F,=0 and F,=0 —3°889 —°6066 
3°0 F, =0 and Ky =0 + 5°0 a 1-0677 
: F,=0 and F,=0 — 5:0 —°*7732 
4-0 F,{ =0 and Fy)>=0 — 1°667 —*3247 
F,=0 and F,=0 -6111 — *9259 




















III. SraristicaL APPLICATIONS. 
(a) General Remarks on Curve Fitting. 


A model would enable us to determine by inspection what type of curve must 
be used given the values of 82, 84 and 8s. This method is not practicable, because 
the model cannot readily be produced with sufficient accuracy for this purpose. 


In the illustrations dealt with in this Section, I have shown how to determine 
the curve to be fitted: 


(1) from first principles as in Example (d) on p. 188; 


(2) by interpolating into the Tables XI—XXI and using Diagram I, Figs. I, 
II and III, to determine the type of curve, then finding if the point falls 
into a heterotypic area or corresponds to a transition curve on the surfaces 


F,=0 or Fg=0 by considering the contour lines of these surfaces as in Example (e), 
p. 191; 


(3) by inspection of the two neighbouring transition curve diagrams, etc., as in 


Example (f ), p. 193. 
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I have discussed the transition curves at great length for 8g<6. For higher 
values of 82 we must calculate the type of curve to be used from first principles, 
following the method used in Example (d), p. 188. 


(b) Distribution of Product Moment Coefficients, pu. 

The distribution of the first product moment coefficient in samples drawn from 
an indefinitely large normal population is a Bessel-function* curve. When the 
sample is larger than 25 an excellent fit is obtained by using a Pearson curve with 
the same mean, standard deviation, and constants 8, and fz. For smaller samples 
a Pearson curve cannot be used to replace the Bessel-function curve. The distribu- 
tion of the first product moment coefficient is symmetrical when the variates of the 
population sampled are uncorrelated. The problem arises whether one of my curves 
with the same mean, the same standard deviation and the same constants Be, Bs 
and §¢ can describe the corresponding symmetrical Bessel-function curve. 

n 














The distribution of v= py ae is 
n-1 aie 
Na — p*) 2 
= C ; : e \" . K,-2} 
J/r.2? 7 (“ °} ) : 
25 
y LS, 2 _ 
N(~#*) 7T.(), m="22, 
n-2 j 1 2 
§ _ ae 
v.22 TS) 


where n is the size of the sample, o1, o2 and p refer to the parent population. 
Log 7, (v) is tabled* for m= 0 to m=11 and for different values of v. 


The sth moment coefficient* of py, about zero is 


,_ I, 
Ks = i,’ 
where I,= A. | [eeu + (— 1) e-*")] uw F'(u) du, 
0 
— n-2 





and F(u)=4 r( 5} 


) u 2 K,_»(2u). 
ae 


A, is found to be 


n 


~% 
N2"-(1 — p*) 2 
aT (n—2) 
On evaluating J, and dividing by I we find 
ne (a2) CT (n+s—1) [(s+1) 


n n+2s+1 n—1\" 
Sy ro 


A, = [2o, 02 (1 _ o*)}. 





* Biometrika, Vol. xxt. p. 164, sq. 
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The reduction formula for f, (p) is 
1 /n—-1 : 
fs (p) = 5 = + s) [fa (p) dp + constant. 
The constant of integration is found from the two reductions 
Ff; (0)=0 for odd s, 
n+ n—1 
a r( 9 +s) T( 3 +s) 
ssi FQ) = CT(n+s—1)T(s4+1) ~ 
(r=0, 1,...8)*. 











J-1(p) = 9, and hence we can find f, (p) 
We find 





n+11\ ,/n—-1 
ay eke TS) p® , 2p? 3p 
Je P) 168 [(n—-1) : 


5 on “n(n+2) 
.+13 n—-1 
is J ta 
2 2 15p* 45” 15 
1 jot es 5 | 




















Js(P) = 46080 I'(n—1) n n(n+2) n(n+2)(n+4) 
n+15 n—1 ; 
Klaiie c" r( 2 )ri 7) Ce | ee 15p 
J7\P)* 92160 T'(n—1) 7 nn  n(n+2)° n(n4+2)(n4+4)]’ 
+17) n— | 
CH CE) Lm 
hii is, 2 ) 2 iy _210p* 
10321920 I'(n-1) n n(n+2) 





420p> 105 
+ n(n+ 2)(n+4) n(n+ 2)(n + 4) (n + 6)]° 
Put p=0, then the sth moment about the mean is 
= (22) T(in+s—-t)T(s+l]) , (0) 
Ben n+2s+1 r n—1 J+ (9). 
ree ie) 


Putting s=2, 4, 6, 8 and substituting for f,(0), we can determine p,, and 
therefore the corresponding f’s. We easily find 
8 3(n+1) 
:= ? 
n—1 


_ 15(n+1)(v+3) 
enema Oe | 








’ 


ee 105 (n + 1) (n+ 3) (n +5) 
ok (n—1)8 : 


* fo(p), Fi(p)s Fo(p)s Fy (p) and f,(p) are worked out loc. cit.—the higher functions follow after some 


lengthy algebra. 
12—2 
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Hence from the equations of (0), p. 136, we find 
2 (n+ 1) (n+3)(n+ 15) _ ai 





Aga = 





n—1 . 
n? + 28n + 90 
mat+n=-—{-;— =-@-p) 
_ 194 22n? + 132n +300 _ 
i 2(n—1) at 


where g=|qi|, p=|p1|, m=|m|, g>p. Therefore g=—q, p=pr, m= mM. 
Similarly 
nie eee n® + 22n? + 132n + 300 
 g@tp. 2 4(Sn* + 128n* + 1140n? + 4968n + 7920) 
on simplification. The size of the sample, n, is > 0, and therefore my Type (i) curve 
must always be used. The equation is 








2\ k 
i 
Y= Yo wi? 
1+— 
P 
v v 
where u= hg Ja=1): 


Taking the total frequency to be 1000 in each case, I have calculated the 
ordinates and the frequencies for samples of size n =3, 5, 10, 25 and 50, for the 
Type VII curve, the normal curve, and my Type (i) curve. I compared the values 
found with the corresponding values of y= 1000 T7,,, as found from the tables, the 
ordinates by plotting, and the frequencies by using the x?-test for goodness of fit. 
Strictly speaking, we cannot use the y?-test in this case, because the test assumes 
an observed distribution, found in sampling. I used it because I know of no better 
test for comparing the frequencies. 

For the Type (i) curve, I found y by quadrature. I took 

yo’ =¢ , 9 (0) = 1000 T,, (0) 
‘ y 
to find the ordinates, and then found the correcting factor (yo’) which would reduce 
the area under the curve to 1000. Thus the true value for 


Yo = (¥Yo') x (yo) = (yo ) x 1000 T n-2 (0). 
ar 
The equations at the top of this page give the following values for the constants 


of the curves: 
TABLE XXV. 














n p q k’ Yo! “a=2 
3 2+3027 93°8027 2+3958 0°6995 50°00 
5 3°5653 67°3153 2*8834 0°9495 250°00 
10 6°7372 589594 4-0760 09864 145°51 
25 | 161536 75°1120 7°5273 0-9985 84-09 
50 | 31-6961 113°1247 131690 | 1:0004 57°83 
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Professor Pearson’s Type VII curve, viz. 


1 
Y= Yor 
3) 


lta 


r n+4 
N 2 
—_ Jo= Jr (n?— 1)} aa @=ni—l, l=t(a+4), 
> 








gives the following constants: 


TABLE XXVI. 


























a 
n a? 1 Y% 
3 8 3°5 331°45 
5 24 4°5 223°26 
0 99 7:0 141°81 
5 624 14°5 83°76 
90 2499 27°0 57°83 
1 v2 
ri . 1000 -;— 
The normal curve is 2n-1 
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I calculated the ordinates and the frequencies for each of the three types 
when n = 3, 5, 10, 25, 50. The ordinates when n=5 and n=10 are plotted in 
Diagrams XX and XXI. The values of P are approximately equal for the Type (i) 
and the Type VII curves, but the normal curve fits badly. The values of x? however 
show that the Type (i) can better describe the Bessel-function curve than the 
Type VII, and this is also shown by the diagrams. For samples of 10 the Type (i) 
curve effectively replaces the Bessel-function curve, for values of n >25 the Type VII 


and the Type (i) curves become identical. 
FREQUENCY CURVES for Pp, 


————BESSEL FUNCTION CURVE 
© TYPE (i) CURVE. 
e@ TYPE WI CURVE 

---- — NORMAL CURVE. 


n=5, N=tOoo, $=0. 


(_]=4 SAMPLES 




















n 
V*pyr aft 


Diagram XX. 


au 
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FREQUENCY CURVES for Py 
BESSEL CURVE. 
© TYPE (i) CURVE 
@ TYPE YI CURVE. 
@ TYPE (i) & TYPE WI CURVE AT SAME POINT 
--—-- NORMAL CURVE 
n=10, N=1000, 7-0. 


[| =15 SAMPLES. 






















KH 
$H 


n°? pA 


U*7-# &, 





~2 


Diagram XXI. 


(c) Distribution of ps in samples. 


Dr J. Pepper* drew samples of 10 from a normal paper with variance 100 
and calculated the 3rd moment coefficient of each sample. He found the following 
distribution: 


TABLE XXVIITfF. 




















Frequency 
Group |, — | Group 
+ -_ 
0—1 42 | 49 15- 
1—2 42 | 31 16— 
2—3 42 | 36 17-- 
3—4 27 | 21 18— 
4—5 26 | 28 19— 
5—6 28 | 17 20- 
6—7 15 | 17 21 
7—8 19 | 12 22 
8—9 15 | 15 23- 
9---10 | 11 2 24- 
10—11 9 | 16 25 
11—12 7 8 26— 
12—13 4 3 27— 
13—14 7 8 28 
14—15 |} 1l 8 29 














Frequency 





| 





—_ 


n 


Oa ee eK DK how eS Oe 
— | FPP -TDNWWNNHwWoa 




















Frequency Frequency Frequency 

Group |————_] Group | ————_] Group acti, 
# La +|- “iS a 
30— —j|— 45— | ~ | 1 60— — 
31— |—|—] 46— |} —|—] 61— 
ae EP 47— 62 
33— 1 1 48- 63- —_ 
34 2 1 49— 64 — 
35 2)— 50- | -- ~ 65— - — 
36 ie Oe, ie “hs Se eee 
s7— | 2 52- —{ 67- a, 
38— 1 - 53— - - 68— 
39 —|- 54 — 69-— — — 
40 —|— 55- - | — | 70—71 1 1 
41 ~ 56 
42 —| 3 57— i 2 ce is ae 
43— ] =f 58- a Frequency 
44— l a 59 — - _ | Total m 
en se ak 


















































* Tam greatly indebted to Dr Pepper for placing his sampling data at my disposal. 
+ Idenote by Frequency + and Frequency — the frequency in groups to the right and to the left 


of the origin respectively. 
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We want Sheppard’s correction formulae for the higher y’s. Now the general 
formula* is 


I? hs 
Yp= My + gyn P(P— 1) Mp2 t+ sro P (P—1)(P—2)(p—3) mpat-.., 


where vy,=pth raw moment coefficient about the mean and y,= corrected pth 
moment coefficient. 


Hence 
Ms = V5 — h? Sus 
Mg = vg —h?. Sy —h*. Se — qh, hi | 
My = v7 —h? Sys —h*. Sys E ivan ducedaeciend (15). 
8 
He= a ee 


Again, if v,’ denotes the uncorrected moment coefficient about the origin, then 
Vy = Vp — pv'p1Vy' + ete = v' p22 — 
Hence 
V5 = V5" — 5vg vy’ + Lvs’ vy’? — 10 v9" vy'3 + 4045, 
Vg = Ve. — Gus) vy! + 15 v4! vy'2 — 20v9' ry* + 15 v9" vy"4# — 5 vy", 
Vy = Vo! — Tye! vy! + 21 y5' vy? — B54’ vy'2 + B5y9' vy"4 — 21 9’ y> + 67, 
Vg = Vg. — Sv_/ vy" + Wy¢' vy"? — 5Gv5' vy'3 + T0v4' ry’* — 5G ' v5 + Wo" vy'* — Try8. 
The following constants were found: 
w= 1957, o =12334, 2, ="0102, 
B2= 7538, 84=139°04, 8,=3702°24, 
fitm=—57815=—(q—p), prigi= —40°208=—gp, m=87'980 =m; 
therefore p= "6873, g=5850, k’ =14864. 
The range is from # = — 94°3 to #@ = + 943. 
The Type (i) curve accordingly is 


a 1-4864 
page + eee 
’ | 8899°45 
FR ae ‘ 
(1+ ioas5 
I calculated ordinates at «=0—(‘125)—°*5, e=1'5—(2)-—95 and found the 
frequencies in the corresponding groups. The fit is very bad. The frequencies in 
the first twelve groups are: 


* Whittaker and Robinson, Calculus of Observations, p. 196. 
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Group Theoretical | Observed + Observed — 
o—-1 35°1 42 49 
1—2 34-2 42 31 
2—3 32°3 42 36 
3—4 29°8 27 21 
4—5 27°0 26 28 
5—6 24°1 28 7 
6—7 21°2 15 17 
7—8 18°5 19 12 
8—9 16-0 15 15 
9—10 13°8 1l 2 

10—1i 11°9 9 16 
11—12 10°3 7 8 




















Pepper also obtained the values for higher §’s of the distribution of the 3rd 
moment in samples*, viz. 


_ 3 (n? + 27n — 70) 











Ba= (n—-1)(n—2) 
A= 15 (n* + 84n® + 2695n? — 15,168n + 20,020) 
— (n—1)? (n— 2)? : .. (16). 
105 n® + 171n* + 13,893n4 + 580,401 n' — 5,131,014n? 
P . 4+ 14,132,268n — 12,932,920 
6=—— 





(n —1>(n—2) 


Hence substituting for Se, 8s, 8. in the equations giving piqi, pit qi and my, 
we obtain, after some very lengthy algebra, 


aqy = M482 — 9885 — 58281 
Pid = ~~ 5B? — Ba(6 — Bo) 








where A = n° + 348n7 + 50,975n* + 4,064,292n* + 34,629,558n4* 
— 646,758,090n* + 2,928,581,096n? 
— 5,507 ,413,800n + 3,806,202,400, 
and B=(n—1)(n— 2) (3n4 + 642n3 — 4879n? + 12,228n — 10,220), 


986 (3 — B2) + TBP - 508284 + 2582 
2 {5 82” — Ba .6 — Be! 





At+n= 


* Biometrika, Vol. xxtv. p. 60. 
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where C =10n’? + 3546n® + 521,736n5 + 34,738,968n!* 
— 392,792,064n® + 1,575,584,616n? 
— 2,789,485,552n + 1,854,410,880. 


— A 2 “ag 3 
Further my = 288 (5B2— 9) — 4982 + 2108284— 12582 
4 (682? — 6 — Be Ba) 
1{D 
= 36 4 It ENGla awascn Sockets es wee ue a bane neeaa eee eesenee (17 c), 
where D =n8 +393n7 + 66,122 n® + 6,216,894n® + 187,648,764 n! 


— 2,369,922,228 n° + 9,840,116,048 n? 

— 17,754,485,784n + 11,957,893,360. 
For all positive integral values of n the signs of pigi, qi + pi and m, are negative, 
negative, and positive respectively. Hence we find in all cases that the Type (i) curve 
has the same first eight moment coefficients as the actual curve of distribution 
of Bs- 


Let us regroup the data, in groups of 5 times Pepper’s grouping unit, and 
compare the distribution so obtained with the curves fitted from -the theoretical 
values and observed values of 82, 84 and fg. 


The regrouped distribution is 


TABLE XXVIII. 


















































- . 

Frequency Frequency Frequency Frequency | ssaiadees | 

| Group | ___| Group < Group | —Ssd}:«GGroup _] Group 4 

! 

| + - - | + a + - + - 

——$_—__—_ —— | mE. _—— — —_ 

O—5 | 179 | 165] 15—20] 26 | 26 | 30-35 | 4 | 3 | 45-50! — ! 1 | 60—65] — | — |] 
| 5—10 88 63 | 20—25 | 16 18 | 35—40 5 | — |50—55 | — | 1 65—70 | — -- 
|10—15 | 38] 43] 25-30] 7 | 10 | 40—45| 2 | 3 | 55—60| — | — 2 bt 8 

















We want to have the standard deviation of the curve calculated from the 
theoretical betas expressed in the same units as the standard deviation of the 
distribution above. 


6(n—1)(n-2 
Now o,,= ai ab — 2) pe’, 
n 
and yz was taken = 100 and the size of the sample is n=10. Hence 
6.9.8 a 
o,.. = 108 . 108 = 432,000 


in actual units. Now the interval between successive groups in the distribution is 
5o0 = 50 and therefore 
432000 __ 
== = 1728. 
2500 
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Put n= 10 in the Equations (17 a), (17 b) and (17 c) and we obtain 

q+ ps = — 55469, therefore p= 18823, 
qa Pr = — 1047°65, a q = 556°572, 
m, = 207689, = k’ = 2372. 

Hence the curve required is 

ae 2 
The regrouped data corresponds to the Type (i) curve, viz. 
1 — a x 00010442 \ 478 
1+ 2 x 010480 ) 


The following frequencies were found: 





y = 182°37 ( 


TABLE XXIX. 









































Theoretical* Observed Theoretical Observed 
Group Group 
(a) | () + 2 (2) (b) + 
O— 5 | 132°9 | 162°5| 179 | 165 | 50—55 *64 *70 |) — j 
5—10 | 96°4] 92°9| 88 63 | 55—60 *43 51 | 7 — — 
10—15 | 56°0| 43°9 38 43 | 60—65 *29 *35 | > — -- 
15—20 | 29°5| 21°5 26 26 | 65—70 *20 23 | | -— — 
20—25 | 14°7| 11:4 16 18 | 70—75 15 °15 1 1 
25—30 8*1 66 7 10 | 75 —80 *h) “09 |) — — 
30—35 4°5 4°0 } a 3 | 80—85 “08 "05 } — | — 
35—40 | 26| 2:5 5 — | 8—90| -06 03 |J — | - 
40—45 1°6 a } 2 3 x? 46°28 20°68 } ey 
45—50 1°0 Bs — 1 gi *00006 ‘08 7 














* The brackets show the grouping for the x? test. (a) and (b) refer respectively to the theoretical 
curve and the curve from the data. 


The poor fit when we fit the theoretical curve to the observed data is due to the 


large difference in the §’s which seems to suggest that the total number of samples, 
viz. 700, is far too small, e.g. 











Be By Be Calculated from 
7°538 139°04 3702 Data in Table XX VII 
7°640 150°23 4257 Data in Table XXVIII 

12°5 670°8 99225 Formulae (17 a—c) 

















Although P = 08 for the curve fitted from the data, this is not a bad fit, because 
the contribution to y? of the frequency in the group —5 to — 10, viz. 63 against a 
theoretical value of 92°9, is 9°6. Diagram XXII gives the regrouped distribution 
with the ordinates of the corresponding Type (i) and Type II curves. 
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(d) American Artillery Experience. 
1000 shots were fired from a battery gun in the United States at a target 
52 feet long and 11 feet high. All the shots hit the target, and their distribution 


in the eleven horizontal strips of one foot each, the point of aim being the central 
horizontal line of the target, was found to be: 





No of strip 1 2 3 A 5 6 


“1 
a 
© 





Frequency 1 4 10 89 190 212 204 193 79 16 2 



































10 ll | 
| 
{ 
| 








I found the mean to be a distance of -482 from the centre of the sixth strip. The 
constants referred to the mean and corrected by Sheppard’s formulae gave 





o = 15499, Ba = 10-9184, 
Bi = ‘0008, Be = 69°2137. 
Bs = 2°5261, 
DISTRIBUTION OF 1000 SHOTS FIRED FROM A BATTERY GUN, 
& 
TYPE (Vii) CURVE ° 
TYPE (XXi) CURVE - - —- Re 


TYPE TID CURVE O / : 
NORMAL CURVE © 



























































5 6 7 8 
STRIPS of TARGET. 
Diagram XXIII. 
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(i) The ordinates of the transition curves for Bz = 2°5261 and 8y= 10°9184 are 


do = 58-089, $e” = 68352, 
$2 = 33-191, os’ = 66-051, 
ds = 63'1822, oe” = 55'889. 


ge = 118-490, 
Since ¢,’ > observed ¢¢ > $e” the point corresponding to the observed §’s falls into 
a heterotypic zone. 
I found the ordinates of the Type (vii) curve for 
B2=2'526, By=10°918, A, =61°352, 
but although a better fit is obtained than that given by Pearson Type II curve, the 
fit is not good. I also tried to fit a sixth order Pearson curve, viz. 
m 

y=Yyoe rn (p-x/oP? 5 
which corresponds to the different equation 

1 dy _ — 4m (2/0?) 

ydz (p—x/a*) >" 





p is given by the cubic* 
p® (B2— 3) + 6p? Bs — 3p (784 — 582) + 986 — 782 Ba = 9 
and m by the equation p*—9p*+ 15p82.—78y=4m. 


Here again the fit is not good. Diagram XXIII shows the above distribution 
and the ordinates of the various curves plotted. 


Note. As a matter of interest I now considered the different types along 
the line 8g= 10°92 in the plane 8,= 2°53. The Table XXX below gives all the 
types discussed. Diagram XXIV (p. 190) shows some of the various shapes obtained. 


TABLE XXX. 























| Signs of 
Bs | Zonet Type 
| 
my |aiP, }atPr | (gi — Py)? | m°— (41 -Pa*| 
| P58 onl | ie 
30 below F,=0 | +]}/—|] + | + | + | (ii) 
33°19 »=0 3 en ee ee + (ix) 
50 (Fo, Fs”) +] - - | + j + (i) 
55°89 F,"” =0 +) - 1 ae | 0 xiii) 
57 (F”, Fy) +i _ | + | — (i) 
58°09 Fyu=0 +|/O]}] - - | _ (xi) 
60 (Fy, F,) +/+] - Pe) _ (iii) 
63°18 F,=0 S1et = + | - (viii) 
64 (Fy, Fs) | -|+ | - * | = (vi) 
66°05 F,=0 -—-/+] - - | 0 (xv) 
67 (Fy, Fe") oe IP. i - | + + (vi) 
68°36 F,’ =0 _ + | 0 + (vii) 
70 (Fe", Fe’) rds) | = - | - 4 (4) 
11849 Fj =0 a ae) ee Oe + | (vii) 
140 above #;,/=0 | — | + | - | + | + | (vi) 

















* In our case the cubic has only one real root. 


+ I denote for shortness by F',=0, on the curve F',=0 and by (F,,, F,)=0, between the curves F,=0 
and F,=0. H marks that the curve lies in a heterotypic region. 
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(e) Errors of the Right Ascensions of Stars*. 
Bessel provides the following data : 








Limits Actual Errors Limits Actual Errors 
0—0°1 114 0°5—0°6 6 

“1— 84 ‘6— 3 

*2— 53 ‘7 1 

-3— 24 *8— 1 

*4— 14 *9— 0 




















The table appears to add the frequencies in the positive and negative groups and 
to give their sums only. We find the constants 


B2= 3848, B,=2681, f_=252-2. 


KC =| 5B2° aes 
Hence = (A.- 6— Br Vile g, ~ Bs* ) = - 2672, 
_ (a. _ 25Ps*(B2+8)\ /{258:?(Be+8) 25|_ gee 
sala 56a /\ 36a ~~ 


Let us now interpolate linearly into the tables of the ®,’ ordinates for Bz = 3°843 
and r = — 2°672. 


¥ ’ 
Values of ®,’. 














r = 3°843 B.=4 

| ee | 

—2°0 — *352 - — — *382 | 
— 2°672 —*451 | —*482 — +488 

~3-0 _ +499 | ~ +540 | 








We find ®,’ = — -482, approximately. But the observed value of ®,’ is — 433 and 
therefore Bg > Px’ > do, since by inspection of the tables for 82=3'5 and 6,= 
$s’ > do. Accordingly, referring to Diagram I, Fig. ITI, we see that we must fit the 
Type (i) curve. Note that we could have reached this result by an inspection of the 
contour diagrams. 


Now (q—p) = —(q. + pr) = 19206, therefore q = 21:107, 
qp=—-Upr = 40°125, m p= 1901, 
m= im = 39°264, m k’ = 1°7065. 


* Astronomische Nachrichten, Bd. 15, Nr. 358—359. 
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aie 1 — u2/21-107\ 1785 

The Type (i) curve accordingly is y = yo Gas ay , Yo = 1240 and the 
range is from — 1:040 to + 1:040. 

Pearson’s Type VII and the normal curves are respectively 

y = 114°145 (1 + 2°140 22)058 
and y= 26499 ( oH e 4 (4°4165 — 
V Qe 


The ordinates are plotted in Diagram XXV. 


Using six groups the following values of y* were found and hence P by the 
goodness of fit test : 


Type of curve (i) Vil " Normal 
x? 669 1348 5517 
” ‘975 923 359 


Both the Type (i) and the Type VII give a better fit than the normal curve. 
There is not much to choose between the Type (i) and the Type VII curves. 


THE DISTRIBUTION OF THE RIGHT ASCENSION OF A STAR. 
BOO OBSERVATIONS MADE BY BESSEL. 





TYPE (i) CURVE —— 
TYPE WI CURVE O 
NORMAL CURVE @ 








SCALE OF FREQUENCY | 10 












































=2 re) 2 
SECONDS of TIME. 
Diagram XXV. 
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(f) W. Palin Elderton’s Data. 


The following example is taken from Elderton’s “Frequency Curves and 
Correlation” : 








] 


Central Value | Observations | Type II Type A | Edgeworth | Normal | 














—2°0 11 7 i a 16 | 
—15 116 109 | = 106 106 | 9 | 
—1-0 274 286 | 284 285 | 270 | 
- % 451 433 437 436 | 456 | 
+°5 | 432 | 433 437 | 436 | 456 | 
ae od eae | 25 | 3 | oat] 
EES | 116 109 | 106 106 | 95 
+2°0 16 Te eee oe eee 











There is no statement of what this distribution describes, and I have taken the 
groups as shown above. The constants are 


Hy’ =— "00149, we=1°829, 8, =°00237, B.=2°548, Byg=9122, and B.=39°50. 

The corresponding auxiliary co-ordinates are r= —°682 and @ =—-049. 

By inspection of the Diagram XV we see that the observed values of r and ® 
fall into the zone between the curves of section of Fy=0 and F,=0. Hence, from 
Diagram I, Type (i) must be used. We find 

p=4926, g=6321, hk =1330, y=47214 


a 1°3302 


The required curve is y= Yo 


Note the range is from — 3°76 to +376. We find the following frequencies : 


r serve ‘rr ency from 
Central Value Observed | Freque i ei 
Frequency Type (i) 


2-0 | 1] 14 
-1°5 | 116 i] 
—1°0 274 278 
a 3 451 438 
+ °5 | 432 4138 
+1:°0 | 267 | 278 
+1°5 | 116 111 
+2°0 | 16 14 





Although we have few groups the Type (i) curve gives a very good fit. All the 
other types, except the normal curve, fit equally well. Note that in this case the 
corresponding 7 and ® co-ordinates are r=—°68 and &=—-04, as compared to 
r=0 and ¢= "09 for Pearson’s Type IT curve. 
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(g) Distribution in Samples of the Mean of an Array as determined by the 
Regression Line (K. Pearson, Proc. Roy. Soc., A, Vol. 112, 1926). 


Let Ye M1, M2, 71, 2, p refer to parent populations, 
and Fx, ZY, X1, U2, 7 refer to the sample of size M. 
x G2 
Hence Yr= Mz + p—(x— mM), 
oj 
Zs 
and Yo=Y+r y (a —2Z). 


The moments of the distribution for 7,’ = Jz — Yr are 


oi ae 22 a 
p2= 3 , Where ¢?= 1 — 


3a! (1 — p?)* (48 2 (1 re 7) 


2 (#-—m,) 
ae ae 
i O1 





by 


va (M—-38)(M—5) |" Mf 


1502°(1 — p? PRS A 2\) 

¥6 ™ (M —3)(M—5)(M—7) hg -¥(2 BS u)? +5p(1 o my 
= 105098 (1 _ p”)* 

Ys = (if —3)(M—5)(M—7)(M-9) 


* 15(M-3) _ ( 1 6 cl 2\ 8 >: 2 1) 
ely ei ae Sit n*w() IESE 


a 105 (M—3, 
~ (M—5)(M—7)(M-9) 


3, 2\ 1 ei 28 aay 2) % 2) 
ae ee 1 ip) git an ' mu) et ap me (1 ~ on) ey 


(1) Take M=50. From the equations above we find the following table* : 


| | | | Ge 
c-m™ | By | By Be r Tv | io Py 
ese | | | | ef, 
| | | | | 
he, | 3051 | 15°806 119°339 | -024 | O | :3768 1874 | 
o; 37102 | 16°618 129°431 7016 | O | °2486 2051 
20; 3°128 17°045 | 136°792 003 | O | ‘2538 2146 | 
| | | | | 


ve Sone = | \ | 


* ry and ®,, are the r and & co-ordinates of the point corresponding to the Type VII curve. 
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(2) Take M=25. 





xz-—m, Be | Bs | Bg r "yu | > Pvn 


| 

tea 
ae 
| 





ho; 3°072 16°195 144°875 091 | 0 


“5460 "1946 
30, | 3-298 | 20-208 | 193-408 


"3326 | +2964 


g 
PD 
i) 





(3) For larger samples we are still closer to the point corresponding to Pearson’s 
Type VII curve. For very large samples the curve of distribution of the mean of 
the arrays follows a normal curve. 


We are so close to the point (0, Py;;) that in all cases the Type VII curve can 
excellently describe the distribution of the mean of the arrays. We expect this 
since 

3(M-—3) 
B= M—5 ” 
15(M-3) 
(M—5)(M-7)’ 


x 105(M—3, 

~ (M—5)(M-7)(M-9)’ 

58" 358.3 

6— Ps on ae (6 — Be) (9 -- 282) 


Ba= 
Be 


satisfy Ba = 


Conclusions. 

I have shown that the fourth order symmetrical Pearson curves give a better 
graduation than the second order Pearson curves or the normal curve, and that 
the improvement in fit, as judged by the x? test, justifies the extra amount of 
labour involved in finding the higher betas. 

I have also succeeded in finding frequency curves whose higher beta-coefticients 
are positive, in the cases By >6 and 45 < Ao< 6. 

Finally, I have discussed the Bz, 8a, 8g-space and the types of curve at different 
points, but the question of finding the type of curve corresponding to the hetero- 
typic area must be left unanswered. This problem demands an analysis of the 
sixth order Pearson curves and a discussion of 8, and Aj will be necessary. We 
cannot expect any advantage from such a system of curves. 


In conclusion I would like to thank Professor Karl Pearson for suggesting this 
problem to me and to express my appreciation of his continual advice and criticism. 
I wish also to thank Miss M. Kirby for the very capable way in which she has 
prepared my diagrams. 
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A BIOMETRIC STUDY OF THE “FLATNESS” OF THE 
FACIAL SKELETON IN MAN. 


By T. L. WOO, Pu.D., In consunctTION witH G. M. MORANT, D.Sc. 


(1) Introduction. It has long been recognised that differences in the degree of 
“flatness” of the face, or facial skeleton, are sufficiently great to be of considerable 
importance in aiding the discrimination of the racial varieties of modern man. These 
differences have usually been taken for granted, and it is only in recent years that any 
attempt has been made to give them quantitative expression. The research described 
in the present paper was undertaken with the purpose of extending these quantitative 
methods and applying them to more adequate cranial material than any previously 
considered. No one appears to have made any serious attempt to define exactly 
what is meant by the “flatness” of the facial skeleton, but this feature of the 
cranium has been customarily referred to when discussing measurements of two 
different kinds. Those of the first are chiefly angular measurements taken in the 
“median sagittal” plane and the more important of them deal with prognathism 
and are modifications of Camper’s facial angle. The greater the degree of progna- 
thism the less “flat” the face is said to be. Measurements of the projection of the 
“median sagittal” section of the nasal bones have been dealt with in a similar way 
by A. Liithy* and G. von Bonin+, the latter having obtained his data from sagittal 
type contours. We are not concerned in the present paper with such measure- 
ments which are confined to the “median sagittal” plane. Those of the second kind 
referred to are designed to give estimates, not of antero-posterior projection, but of 
transverse flattening. One may consider this feature at a number of different and 
convenient horizons and it is clear that a considerable number of measurements 
would be needed to give any appreciation of the transverse flattening of the whole 
facial skeleton. C. de Mérejkowsky seems to have been the first to make a useful 
contribution to this subject. In a paper which appeared in 1882} he described 
@ new instrument—now known as a simometer—designed to measure the projection 
of the nasal bones. The means found for a number of racial series of crania 
suggested that his method would be a valuable one, but it was not adopted by 
craniometricians. Thirty years late. a study of negro skulls was issued in 
Biometrika§ in which special attention was paid to the same character. The 

* “Die vertikale Gesichtsprofilierung und das Problem der Schiidelhorizontalen.” Archiv fiir 
Anthropologie, N.F. Bd. x1. 1912, pp. 1—-87. 

+ Biometrika, Vol. xx. 1931, pp. 260—261. 

} ‘Sur un nouveau caractére anthropologique.” Bulletins de la Société @ Anthropologie de Paris, 
diéme série, t. v. pp. 293—304. 

§ “A Study of the Negro Skull with special reference to the Congo and Gaboon Crania.” By 
R. Crewdson Benington. Prepared for press by Karl Pearson. Vol. vit. 1912, pp. 292—339; see 
pp. 315—320. 
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new measurements taken for this purpose, in addition to those defined by 
de Mérejkowsky, were the chord from dacryon to dacryon and the minimum are 
between the same points, so no special instrument was required other than a 
simometer. A paper published in the same journal in the following year* provided 
a detailed study in man and the anthropoid apes of the same measurements and 
also of the subtense from the dacryal chord, the simotic and dacryal subtenses 
being found with the simometer. Cranial series representing 16 races were dealt 
with, but several of these were admittedly too small to give reliable means. The 
dacryal and simotic measurements have been provided for the numerous cranial 
series described in later volumes of Biometrika, and the material available for them 
is now quite extensive. Their omission from all other recent craniometric studies 
may be largely attributed to the fact that the simometer is a somewhat complicated 
instrument which would need to be made to order. Other estimates of the trans- 
verse flattening of the facial skeleton—including in that the supraciliary region as 
Broca did—have been obtained from measiirements of horizontal type conteurs. The 
section represented is one through the glabella parallel to the Frankfort horizontal 
plane and indices measuring its curvature anterior to the temporal lines have been 
given by G. M. Morant for several racest. More recently Davidson Black has 
defined certain angular measurements to measure the same feature{. These are 





found with a goniometer or from the horizontal section on which the projections of 
certain facial points are marked: they are only available for Chinese series. 


The writers’ purpose was to make a fairly comprehensive survey of the degree 
of transverse flattening of the facial skeleton considered as a whole in a sample of 
races from all parts of the world. The use of any projective methods would have 
made it impossible to record a sufficiently large number of measurements in the 
time available, and accordingly they decided to use a simple pair of co-ordinate 
calipers only. Four pairs of measurements, defined in the next Section, were taken 
to give four indices, of which all but one are new. The simotic index, which was 
included, was first defined by de Mérejkowsky, and a considerable amount of 
material for it has accumulated since. Ryley, Bell and Pearson concluded that, for 
purposes of racial comparison, “a measure of the mesodacryal index is likely to be 
of as much value as the determination of the simotic index, possibly of greater 
value,” but nevertheless we chose the latter rather than the former to measure the 
prominence of the nasal bridge because it can be determined on a far larger number 
of museum specimens. The lacrymal bones are frequently missing, and it is then 
impossible to locate the dacrya with precision. 


(2) Definitions of Measurements. The object of the measurements used in this 
study was to provide accurate estimates of the transverse flattening of the facial 
skeleton at different horizons. This was effected by measuring directly, with the 


* “A Study of the Nasal Bridge in the Anthropoid Apes and its Relationship to the Nasal Bridge in 
Man.” By Kathleen V. Ryley and Julia Bell, assisted by Karl Pearson. Vol. rx. 1913, pp. 391—445. 

+ Annals of Eugenics, Vol. 1. 1927, pp. 355—356. 

+ Palaeontologia Sinica, Series D, Vol. vit. 1928, pp. 138—14, 
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aid of a special pair of calipers, the subtense of a particular “median” point from 
a corresponding chord, of which the terminals are the same point on the right and 
left sides, and also the length of the chord. The index obtained by expressing the 
subtense as a percentage of the chord provides a measure of the character with 
which we are concerned, and four different indices were found, whenever possible, for 
each skull. The readings required could be taken rapidly and accurately with a 
pair of co-ordinate calipers. This instrument consists of a bar, on which a scale is 
inscribed, with three arms which always remain mutually parallel to one another 
and perpendicular to the bar such that their tips, which are brought into contact with 
the object measured, are always co-planar with the scale. One of the arms is fixed 
to the zero end of the bar, and another of the same length can be moved along it 
so that these two are used in precisely the same way as the arms of an ordinary 
pair of rectangular calipers. The third is attached between the other two, and it 
can also be moved along the bar, the attachment permitting a movement so that 
the setting point of the arm can be shifted towards or away from the bar. This 
middle arm bears a scale on which the reading is zero when the tips of the three 
arms are in a straight line. When the middle arm is moved towards or away from 
the bar the divergence of its tip from that straight line—i.e. the subtense—can be 
read. In our case the tips of the outside arms were first brought into contact with 
the extremities of the transverse chord, and then the middle arm was moved until 
its tip was in contact with the appropriate “median” point, a screw enabling it to 
be fixed in that position. On removing the calipers both the length of the chord 
and the subtense of the “median” point from it were recorded. Another reading 
might have been taken at the same time. This is the distance of the foot of the 
perpendicular—i.e. the “working edge” of the middle arm—from one or other 
extremity of the chord which would have given an appreciation of asymmetry. The 
measurement in question was not recorded, but for two series the direct distanve of 
the “median” point from the extremity of the chord was found on either side. The 
form of co-ordinate calipers made by P. Hermann, Richenbach und Sohn, of 
Ziirich, was found to be unsuitable for taking the measurements devised for two 
reasons: the middle arm is too broad to make it possible to obtain the adjustment 
required in the case of narrow nasal bones, and the outside arms are not long 
enough to enable other subtenses as large as some needed to be found. Accordingly 
a similar instrument was made to order for Dr Woo by Messrs W. F. Stanley and Co. 
The breadth of the middle arm was reduced to 2°5 mm., and the outside arms were 
made 10 cm. long, All the measurements needed could be readily taken with these 
calipers on all the skulls examined. 


The measurements are: 


(1) ZOW. The internal bi-orbital breadth between the points, right and left, 
where the fronto-malar sutures cross the outer margins of the orbits. The point 
used, which is Martin’s fronto-malare orbitale, can almost always be found precisely 
if the suture be open. If the orbital margin be blunt, it is advisable to mark it first 
as a pencil line. The measurement is No. 43 (1) in Martin’s list. 
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(la) Sub. TOW. The subtense of the nasion from the chord TOW. 
(1b) The frontal index of facia] flatness is defined to be 100 Sub. JOW/IOW. 


(2) SC. The simotic chord which is the minimum horizontal breadth of the 
two nasal bones, so that the extremities of this chord are points on the naso- 
maxillary sutures. 


(2a) SS. The simotic subtense found by first marking as a pencil line the 
“ridge” of the nasal bones, without necessarily following the inter-nasal suture, and 
then finding the minimum subtense from this line to the simotic chord. When 
found with co-ordinate calipers this may not correspond exactly with the simotic 
subtense found with the aid of de Mérejkowsky’s simometer. A comparison between 
the two measurements found for the same series is made below. 


(2 6) The simotic index is defined to be 100 SS/SC. 


(3) MOW. The mid-orbital breadth between the points, right and left, where 
the malar-maxillary sutures cross the lower margins of the orbits. It is generally 
advisable to mark the lower margins of the orbits as pencil lines before locating the 
points. 

(8a) Sub. MOW. The subtense from the “tip” of the nasal bones, which is 
accepted to be the lowest point on the inter-nasal suture (i.e. Martin’s rhinion), to 
the chord MOW. This was found to be an unsatisfactory measurement owing 
to the fact that the lower parts of the nasal bones are often defective, even on 
skulls which have the remainder of the facial skeleton quite complete. It is some- 
times difficult to decide whether the region in question is intact or not, and in all 
doubtful cases the measurement was omitted. 

(3b) The rhinal index is defined to be 100 Sub. MO W/MOW. 

(4) GB. The chord between the points, right and left, on the zygomatico- 
maxillary sutures which are lowest with regard to the Frankfort horizontal plane. 
The positions of these points can be estimated with sufficient accuracy without 
orientating the cranium exactly. If the inferior extremity of the suture is a short 
length lying parallel to the horizontal plane, the anterior point on it is the one 
accepted. The point used is Martin’s Zygomasillare and the chord (No. 46 in his 
list) is one usually given in the descriptions of racial series. 

(4a) Sub. GB. The subtense of the alveolar point from GB. The true alveolar 
point was used, i.e. the inferior point on the process between the cavities of the 
central incisors, and not the prosthion. 

(4b) The premaxillary index of facial flatness is defined to be 100 Sub. GB/GB. 

The first three of these indices are of a straightforward character, and in 
each case the three points involved usually lie in a plane which is approximately 
parallel to the Frankfort horizontal plane. For the first the points all lie on 
the inferior borders of processes of the frontal bone; the second measures the 
flatness of the bridge of the nose and it is confined to the nasal bones, while 
the third is of a less simple type since it involves the nasal and maxillary bones. 
The points used in determining the fourth index all lie on the premaxillary bones, 
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but the “median” point—the alveolar point—is always decidedly inferior to the 
extremities of the transverse chord. The subtense will obviously be affected by 
differences in the height of the premaxillary region, though the index may be 
supposed to give primarily a measurement of sub-nasal prognathism. We should 
anticipate that the latter would be highly correlated with a measurement of the 
total prognathism of the face. 


Of the absolute measurements defined above, TOW, SS, SC and GB have been 
provided for a number of cranial series, and all except the first were previously 
available for several of the series measured for the purpose of this study. Hence, 
comparisons can be made to test the reliability of the same measurement taken on 
the same specimens by two different observers, and this is particularly necessary 
in the case of the simotic subtense as two different instruments, and slightly 
different definitions have been used to determine it. De Mérejkowsky describes 
how he measured the character in question with his simometer. The two arms 
of the instrument are placed so that their tips coincide with the extremities of the 
minimum breadth of the two nasal bones, and then, to bring the tip of the sub- 
tense arm into position, “on l’appuie sur le dos du nez la ot celui-ci atteint son 
minimum d’élévation ; les trois points de repaire (sic!) sont donc faciles & trouver 
et complétement fixes*.” But the last statement does not seem to be true since 
“le dos du nez” is an inexact term and yet it forms an essential part of the 
definition, as it is only with reference to it that the minimum d’élévation can 
have any meaning. We must suppose that de Mérejkowsky was imagining a 
“median” line, which might be traced in pencil on the nasal bones, such that it 
would indicate their “ridge,” or the line marking the join of the right and left 
sides of the surface formed by the two nasal bones together. On a symmetrical 
cranium this would coincide with the inter-nasal suture, but the direction of the 
suture was not considered, presumably, in determining the “ridge” if the nasal 
bones were asymmetrical. The subtense would then be the minimum from this 
line to the simotic chord, and having fixed its position it would also be the 
maximum subtense of the section of the nasal bones defined by the three points 
of contact of the instrument. This method of measuring the simotic subtense 
appears to have been that followed by Ryley, Bell and Pearson, and by later workers 
in the Biometric Laboratory who used the simometer. Dr Woo used co-ordinate 
calipers in place of that instrument, but followed the same convention in making 
the point of subtense of the subtense arm one on the “ median” ridge of the nasal 
bridge. When the compass arms of the simometer are fixed, as to the simotic 
chord of a particular cranium, the pivot of the instrument, which is equidistant 
from their tips, is also a fixed point. The middle, or subtense, arm is then free 
to rotate round that pivot in such a way that the tips of the three arms and the 
pivot are always co-planar. It is adjusted so that its tip touches the point on 
the accepted transverse section of the nasal bones which is nearest to the pivot. 
The simometer is then removed from the cranium and, while the direction of the 


* Loe. cit. p. 298. 
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subtense arm relative to the compass arms is maintained, its tip is moved until 
it is in line with the tips of the compass arms. The subtense measured is the 
displacement of the tip of the subtense arm*. If the section in which the simometer 
measurements are taken is asymmetrical, the subtense may not be perpendicular 
to the chord and the point of maximum subtense may not coincide with that found 
by using the co-ordinate calipers. It is clear that the simometer subtense will 
always be equal to or greater than the subtense found with co-ordinate calipers, 
and we must ask whether the readings found with the two different instruments 
are sufficiently close to be considered comparable for practical purposes. To decide 
the point Dr Woo found the nasal bridge measurements with co-ordinate calipers 
in the case of a series—the Kerma Egyptian measured by Miss Collett—for which 
the measurements had previously been recorded by using a simometer. Com- 
parisons are made in the following table, the constants relating to differences 
having been found from the distributions of differences : 





Character 
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| Collett with simometer 


Woo with co-ordinate calipers 
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* The simometer is rather a complicated instrument and the reader who is not acquainted with it 
may find it difficult to follow our remarks without refering to de Mérejkowsky’s description and figure 
in the paper cited, 
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The mean difference between the male simotic subtenses (SS) is seen to be 
small but quite significant, the mean found by using the simometer being greater 
than that given by the co-ordinate calipers. The difference between the female 
means, however, is of the opposite sign though quite insignificant. But some 
significant differences are also found for the simotic chords (SC) and _ facial 
breadths (GB), though in these cases the two observers followed precisely the 
same definitions. 

For the male simotic index the difference is actually insignificant, while for 
the female it is quite significant but of the opposite sign to that which would 
have been expected. We may conclude that the differences between the two 
methods of determining the simotic subtense only introduce errors which are 
of the same order as those due to the personal equations of two different workers 
following the same definition*, and accordingly we have supposed that the simotic 
measurements found with the co-ordinate calipers may be compared without cor- 
rection with those previously found with the simometer. For all the series dealt 
with (except the Kerma Egyptian) which had any of the measurements JO W, SS, 
SC, 100 SS/SC and GB provided in previously published papers, those values were 
accepted to give the constants provided in the present paper. It may be observed 
that none of the differences shown in the table on p. 201 is large enough to vitiate 
racial comparisons, except possibly the differences for the simotic chord, since they 
are very small compared with interracial differences. For the simotic index, for 
example, a difference of about one unit is shewn between the means found by the 
different observers for the same series, but the range of the mean male simotic 
indices for different series is from 20°7 to 53:1. 


In the next place, we may consider the few measurements collected for the 
purpose of examining asymmetry. For two male series the direct distances of each 
median point from the extremities of the corresponding chord were recorded and 
a comparison of these bilateral measurements is made in the table on p. 203. 


Only one of the differences of means can be considered significant and if there 
be any differentiation of the bilateral measurements considered it is evidently of a 
small order which would only be revealed by examining larger series. 


(3) Description of the Material Measured. The object of the present paper is to 
test the racial significance of the new measurements of facial flatness defined in the 
previous section. In order to serve this purpose it was necessary to measure as large 
numbers as possible of crania representing as many races as possible from all parts 
of the world. The observer (T.L.W.) was obliged to limit his material to that con- 
tained in the principal English museums and one Dutch museum. He is greatly 
indebted to the curators of these institutions for affording him every facility for 


* In the paper by Ryley, Bell and Pearson (loc. cit. pp. 396—398) comparisons are made between 
various mean simotic measurements found by the same observer who repeated them on the same series 
of crania at different times. The largest differences shown are 0°10 for SS, 0:36 for SC and 1-0 for 
100SS/SC. These are of the same order as, but rather less than, the largest differences shown in the 
table above for the corresponding measurements. 
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pursuing this study and particularly to Miss M. L. Tildesley (Royal College of 
Surgeons), Dr L. H. Dudley Buxton (Oxford), Dr W. L. H. Duckworth (Cambridge), 
Mr W. P. Pycraft (British Museum), Professor J. A. J. Barge (Leiden) and, finally, to 
Professor Karl Pearson (Biometric Laboratory). The number of adult crania actually — 
measured was 4266 male, 1630 female, the sexes accepted being in general those 
provided in previously published papers or museum catalogues, though in the case of 
a few series the writers had to assign sexes to the specimens themselves. No imma- 
t ture specimens were dealt with. Whenever possible long series representing single 
racial types were chosen and a number of earlier studies—nearly all of which have 
appeared in Biometrika—dealing with their characters were consulted as a guide to 
the groups which could be considered racially homogeneous. In the case of a number 
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of races, however, it was only possible to arrive at means based on sufficiently large 
numbers by pooling the measurements of short series in different museums. Or 
means of larger series might be pooled if there was sufficient reason to believe that 
they represented the same racial type. The test used in such cases was based only 
on the four indices of facial flatness. If two or more series were thought, from the 
description of their origins, to represent the same racial type, then their mean indices 
were calculated and if no significant differences were found between them the series 
were pooled to give the constants finally used. This practice is only of provisional 
value, of course. Two samples of skulls may show one character, or group of 


* The probable error of the difference of :neans is given by the formula 


67449/ Jnx Jon? + oz? —2rRioReL- 


+ This approximate probable error was found by using the o’s and r given for the Punjabi series. 
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characters, identically the same while differing most significantly in other ways. 
Several of the series finally used are prebably less homogeneous racially than any 
which could legitimately be used for purposes of rigid racial classification, but they 
may still be of value in a preliminary study such as the present. All the continental 
areas are represented, but the races used do not form a random sample from all 
races of the world. There are several British series, but few long series from other 
parts of Europe; the races of Africa and Oceania are fairly well represented; the 
American and Western Asiatic series are particularly meagre, while there are none 
from Central Asia and Siberia, but India and the Orient are better represented 
than any other regions. The means are given of a few series which are so short that 
no use can be made of them. It may be possible to pool these with measurements 
collected later from other sources. 


The museums in which the cranial specimens were measured are: The Museum 
of the Royal College of Surgeons, London (R.C.S.); the British Museum, Natural 
History (B.M.); the Biometric Laboratory, University College, London (Biom. Lab.); 
the University Anatomical Museum at Cambridge (Cambridge); the Oxford Uni- 
versity Museum (Oxford); and the Museum of the “ Anatomisches Laboratorium,” 
Leiden, Holland (Leiden). The abbreviations given in the brackets are used to 
denote them in this paper. Bm. denotes this Journal and J.A.I. the Journal of the 
Anthropological Institute. 


A. EUROPEAN SERIES. 


I. English: Long Barrow. Cambridge 18 ¥. The skulls in Thurnam’s collection 
came chiefly from different localities in Wiltshire. They are of the Neolithic period. 


II. Romano-British. Cambridge 11 /,3 3; Oxford 81 ¢,43 2. The specimens 
at Cambridge came from a singie cemetery at White Horse Hill, Berkshire, and 
those at Oxford came from a number of counties, mainly from Oxfordshire, Berkshire 
and Wiltshire. 


III. Anglo-Saxon. (a) Cambridge 13 ¥,8 2; Oxford 41 ¥,19 2. These skulls 
came from a considerable number of cemeteries in different parts of England. 
(b) Cambridge 42 ¥, 20 $, only the complete skulls being measured. This series 
was recently excavated by Mr Lethbridge at Burwell, Cambs., from a cemetery 
of seventh century date. In order to justify the pooling of the measurements of the 
above two series the male means and standard deviations for the four indices dealt 
with were calculated separately so that the significance of the differences of the 
means for these characters between the two series could be satisfactorily tested. 
The values are given in the Table at the top of p. 205. 

It is evident that all the differences between the means are quite insignificant. 
This justifies the provisional pooling of the two series and the combined means, 
which have been used for comparative purposes in this paper, are given in Table I. 


IV. Hnglish: Mediaeval. Oxford 43 ~, 11 2. These crania came chiefly from 
Abingdon, Berkshire, and Sittingbourne, Kent. A few unspecified specimens are 
also included. 
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Frontal Simotic 
Indices 
Mean | o Mean | o 
Series (a) 18-6+°19 (52) | 2074-14 | 52°341-06 (44) | 10-414-75 
Series (b) 18°5+°22 (40) | 2°064°16 | 524+ -87(33) | 7°544-63 
(a) —(b) +0°1+°29 = | —0-1+1-38 | oe 
| | | 
Rhinal Premaxillary 
eS tat BRAN INS eS. 
Series (a) | 45°44 °85(16)| 5°044°60 | 35°14°36(42) | 3°50+-26 
Series (6) | 45°2+1°39 (6)* | = | 35°14°47 (34) | 4°10+°34 
(a) - (0) | +0°2+1°63 — 0-0+ “60 _ 











V. English: Farringdon Street. Biom. Lab. 81 /, 76 $. This series came from 
a London churchyard used in the seventeenth century. See Bm. Vol. Xv1t. pp. 1—55, 
1926. 

VI. English: Spitalfields. Biom. Lab. 248 #, 94 2. This series came from a 
burial-ground at Spitalfields Market, London, and it is of mediaeval or Roman date. 
See Bm. Vol. Xxul. pp. 191—248, 1931. 

VII. Modern Irish. R.C.S. 19, Cambridge 8, Oxford 38 (soldiers): all ¢. 
these only 15 are from known localities of the Irish Free State. The majority of 
the crania, including 38 soldiers and 5 seamen, are from unknown localities in Ireland. 

VIII. Modern Scottish. R.C.S. 19, Cambridge 3, Oxford 13 (soldiers): all ¥. 
Twenty skulls came from known localities of Scotland and others are unspecified. 

IX. British (Soldiers). (a) English soldiers: Oxford 31. Their native places 
are unknown. (6) British soldiers: Oxford 82. No detailed information relating to 
the origin of these specimens is available. 




















} 
Frontal | Simotic 
| Indices | = —— - 
| Mean | o | Mean | o | 
Series (@) |} 19°1%°29 (31) | 2°35+°20 | 51°5+1°65 (30) | 13°36 + 1°16 
Series (b) | 1894-18 (82) | 2464-13 | 53°14 -97 (81) | 12°84+ -68 
(a) —(b) | 4-24-34 | —1°64+1°91 Ss 
| ‘ 
| 
3 | Rhinal Premaxillary 
! 
ane 
| Series (a) | 43°2+ °90(19)| 5°82+°64 35°6 + °32 (27) 2°45 + °23 
| Series (6) 44-74 -49(65)| 5°894°35 36°34°24 (74) | 3°03+4°17 
| (a) =(b) ~1°5+ 1-03 me —"T£°36 _— 
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*|The probable error of the mean of the rhinal index for the (b) series is based on the standard 
deviation of the same index for the (a) series, 
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The mean measurements of the above two series have been pooled to give those 
in Table I (p. 222). It is evident from the figures shown above that the mean 
ditferences for the four indices between the two series are too small to be significant. 
It is, therefore, legitimate to pool them. 


The last three series VII—IX above are probably less homogeneous racially 
than any other British series used. The means derived from them can only be 
used provisionally. 


X. North German. R.C.S. 28, Cambridge 4: all ~. The specimens came 
principally from different provinces of Prussia—Brandenburg, Holstein, Westphalia, 
Hessen and Saxony being represented. 


XI. French. R.C.S. 46 ¥, 19 3. These skulls came mainly from the catacombs 
of Paris, though a few are from unknown localities. They belong to the post- 
Merovingian period. 


XII. Jtalian. R.C.S. 102 /, 33 $; Cambridge 1 2. These specimens, which 
previously belonged to G. Nicolucci, came from 12 divisions in the northern and 
central parts of Italy. A few from the islands of Sicily and Sardinia were excluded. 
They are all of modern date. 


XIII. Norwegian. R.C.S. 19 ~. These specimens came from a Seamen’s 
Hospital at Greenwich, so they are presumably the skulls of seamen. One individual 


is stated to have come from Bergen, but the home districts of the others are not 
known. 


XIV. Swedish. R.C.S. 30 ¥. Of these, 9 crania came from the neighbourhood 
of Stockholm, and the others are only marked as seamen, 


XV. Finnish. R.CS, 22 ¥, Cambridge 1°. Ten of these crania came from 
various localities of Finland, and 13 from a Seamen’s Hospital collection are recorded 
as Russian Finns. As the series is a short one, no useful comparisons could be made 
between the two groups. 


XVI. Dutch. R.CS. 29,10 2; Leiden 22 /. Two-thirds of these specimens 
came from different known provinces of Holland, viz. Guelderland, Utrecht, 
Friesland, Groningen, Zeeland and North Holland: the others are unspecified. 


XVII. Ancient Greek. R.CS. 3 ¥, Oxford 9 ~. These skulls came from 
Athens, Attica and the Ionian Islands, and they are dated approximately between 
500 and 200 B.c. 


XVIII. Modern Greek. R.C.S. 18 ~, Oxford 15 ¢. The localities from which 
these specimens came are: 8 from the Greek mainland, 12 from the Ionian Islands, 
4 from the Aegean Islands, and 9 from unknown districts of Greece. 


XIX. Czech. R.C.S.23 ¢, 102. These specimens came from Bohemia, Slovakia 


and Moravia. The central part of Moravia is better represented than the other 
regions, so the series may be loosely called a Czech one. 
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XX. Russian. R.CS. 20 #, Leiden 16%. Of these specimens, 22 came from 
various scattered localities in Russia, and others are marked as “unknown” in 
locality. 


European races, apart from British, are much less satisfactorily represented — 
than are those of the other continental areas except America. This is chiefly due 
to the fact that there are few European skulls other than British in English 
museums. The pooling of several short series from various localities of the same 
country had to be effected in several cases in order to obtain sufficiently large 
samples. 


B. AFRICAN SERIES. 
I. Guanche. R.C.S. 16 ¥, 19 2; Cambridge 5 f', 49; Leiden 47,5 3. Nearly 


all these specimens came from Teneriffe. 


II. Nubian. Biom. Lab. 69 ¥*, 61 ¢. This series was excavated by the 
Archaeological Survey of Nubia in 1907—1908 from sites immediately south of 
Assuan. It consists only of New Empire (17th—20th Dynasty) crania. 


Ill. Egyptian: Badari. Biom. Lab. 36 ¥, 22 2. See Bm. Vol. xix. pp. 110— 
150, 1927. This series is believed to be the earliest of predynastic date that has 
yet been discovered in Egypt. It has been shown that the earlier Egyptian series 
bear a closer resemblance to negro types than do the later series from the same 
country. 


IV. Egyptian: Sedment. Biom. Lab. 40 “, 30 2. See Bm. Vol. xxu. pp. 65— 
92,1930. This series came from a cemetery of 9th Dynasty date at Gebel Sedment, 
which is about 70 miles South of Cairo. 


V. Egyptian: Kerma. Biom. Lab. 117 ¥, 97 2. These skulls represent the 
Egyptian population of Kerma (Nubia) in the 12th and 13th Dynasties. See Bm. 
Vol. XXV. pp- 954—284, 1933. 


VI. Negro: Nigeria. (a) Crania from North Nigeria. R.CS. 64, 49; 
Cambridge 13 /, 3 $. These skulls came from various parts of North Nigeria. 
(b) Crania from South Nigeria. R.C.S. 34 ¥, 12 2. These skulls came from several 
provinces of the country, and the majority of them are recorded as representing the 
Ibibio and Ekoi tribes of the Calabar region, The male means of these two series 
for the four indices were calculated separately and are shown below. The probable 
errors of the North Nigeria means were found by using the South Nigeria standard 
deviations. 





| ! 
Indices | Frontal Simotic Rhinal Premaxillary 
van | oe 2 
(a) N. Nigeria (Mean) | 17°7 +°29 (18) | 30°7 41°68 (17) | 30°2 41°16 (11) | 36°24 “56 (17) | 
(6) S. Nigeria (Mean) | 17°6 +°21 (34) | 28°9 + 1°22 (32) | 29°5 41°11 (12) | 36°5 + -43 (29) 
Ss. Nigeria (oa) 1°824+°15 10°24+ “86 570+ “79 34°24 °30 
(a) —(b) +°l +:°36 +1°8 +2°08 +'7 +1°61 —3+°71 
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It is obvious that none of the differences between the corresponding pairs of 
indices should be regarded as significant. The pooled means in Table I (p. 222) 


were used for comparative purposes. 


VII. Gaboon Negro. (a) Gaboon, 1864. B.M. 504, 44 2. See Bm. Vol. vi. 
pp. 292—339, 1912. This series was collected by Du Chaillu in Fernand Vaz. 
(b) Gaboon, 1880. B.M. 18 ¢, 19. See ibid. This series came from the same 
locality, and it was coilected by the same person. It has been shown that series (a) 
and (b) bear a very close resemblance to one another when all the usual cranial 
characters are compared. (c) Gaboon, collected at various times. R.C.S. 12 4", 6 2. 
These specimens came from the neighbourhood of Fernand Vaz and the Gaboon 
river. The mexns of the above three male series for the four indices are shown in 
the following table. 





Indices Frontal Simotic Rhinal Premaxillary 














Gaboon, 1864 (Mean) 17° +°21 (46) | 30°9 + °91 (45) | 28°9 + °41 (32) | 38°5 +°37 ( 
Gaboon, 1880 (Mean) 17°5 +°34*(18) | 29°0 +1°52 (16) | 29°2 + -64(13) | 38°5 +°64 (13) 
Gaboon, R.C.S. (Mean) | 18°3 +°42*(12) | 28-3 +1°92(10) | 28°7 +1°16 (4) | 39°3 +°73 (10) 
Gaboon, 1864 (c) | 2°134°15 901+ °64 3-434 -29 3°42 +°26 

(a) - (b) 0+°40 +19 +1°77 -—"3 + °76 O+°74 

(a) —(c) | —*8 +°47 +2°6 +2°12 4+°2 41°23 — "8+ 82 

(b) —(e) | —"8 +°54 + °7 42°45 +°5 *+1°32 —'"8+:97 


























It is evident that the ditferences between all possible pairs of means are quite 
insignificant, and the combined means given in Table I may therefore justifiably be 
employed for comparison in this paper. 

VIII. Angoni. B.M. 25%. See J.AJ. Vol. xxvui. pp. 55—94, 1898. The 
specimens were obtained from M’ponda’s Town at the south end of Lake Nyassa. 
The donor, Sir H. H. Johnston, describes them as being the crania “of a slightly 
mixed negro race mainly belonging to the Anyanja stock with a slight Zulu inter- 
mixture. It is possible, however, that one or more of them may be Yaos mixed with 


Arab blood.” 
IX. Ashanti. R.C.S. 12 f; Cambridge 2~; Oxford 20 f, 31 2. Of these, 


4 male specimens came from Kumasi, and 3 males were of Ashanti warriors 
killed in war in 1873—74. Others are merely marked as having been obtained 
from districts on the West coast of Africa, and no further information is available. 

X. Teita Negro. Biom. Lab. 57 ~, 65 2. See Bm. Vol. xxi. pp. 271—314, 
1931. The Teita, or Wa-Teita, are a small tribe of Bantu-speaking negroes living 
in the South of Kenya Colony. 

XI. Zulu. R.CS. 17 ~. Most of these skulls are of Zulu warriors. They came 
from two battle-fields on the Zulu border, Isandhlwana and Ulundi, where fighting 
took place in 1856 and 1879 respectively. 


* The probable errors of the four indices for series (b) and (c) were obtained by using the corre- 
sponding standard deviations of series (a). 
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XII. Congo Negro. BR.C.S. 58 ¥, 30 $; Leiden 2 ¥,1 2%. See Bm. Vol. vim. 
pp. 292—-339, 1912. The majority of these specimens represent the Batetela tribe 
who live at about 24° 20’ E., 4° 51’S. near the Lubefu River. A few came from the 
neighbourhood of the Congo basin and the Upper Congo regions. 


XIII. Bushman. R.C.S. 9,14 3 ; Cambridge 27; B.M.1/; Oxford 11,3 9. 
Of these, there are 4 males and 1 female from the Cape of Good Hope, 1 male from 
the Transvaal, 1 male from Namaqualand, and 1 female from the Orange Free 
State, while the localities from which the others came are unknown. 


XIV. Hottentot. R.CS.2¢/, 62; Oxford 7/, 29. Of these, 3 males and 
1 female are known to have come from Namaqualand, Middleburg, the Cape of 
Good Hope and the Cape of St Francis respectively, while the others are from 
unknown localities. 


XV. Kafir. RCS. 22¢, 73; Cambridge 5f, 29; Oxford 257, 12; 
B.M. 3 . Various tribes of Kaffirs are represented in this series, but most of the 
specimens represent Bantu negroes inhabiting the South and South-East of Africa. 
Six male specimens are stated to have been obtained from native burial-grounds 
during the Kaffir War in 1847. 


C. AMERICAN SERIES. 


I. Eskimo. R.C.S.33¢,173; Cambridge 47,19; Oxford 11/,112; BM.4¢, 
12. Of these, 31 males and 18 females are marked as having come from various 
parts of Greenland and the neighbouring islands; 12 males and 8 females are from 
several islands situated to the North of Baffin Land and from the North-East coast 
of Labrador. A few came from unknown localities. All the specimens have been 
pooled together to make up a sufficient number for the purpose of statistical enquiry. 


II. Peruvian. This is one of longest series which was measured for the present 
purpose. In order to examine locul differences in the characters dealt with, the 
material may be first divided into two groups: (a) Specimens from Pacasmayo. 
R.C.S. 20 f, 139; Cambridge 47,7, 222; Oxford 4, 42. These mostly belong to 
Consul Hutchinson’s Collection. (6) Specimens from other localities. R.C.S. 377, 
282; Cambridge 10 ¥, 53; Oxford 55 ¥', 422. These were mostly purchased from 
Consul Hutchinson. The localities from which the majority of these skulls came 
are: Cerro del Oro (in the Canete Valley, 100 miles South-East of Callao), Ancon, 
Huacho (near Callao) and Santo, all situated in the Western part of Peru. The first 
two named are better represented than the others. The means of the four indices 
for the two male series are given in the table at the top of p. 210. 


From the figures shown it is apparent that the mean differences are all too 
smal] to be considered significant except in the case of the simotic index for which 
the difference is 2°6 times its probable error. It was considered that for the present 
purpose all these Peruvians might be pooled together although a significant 
difference occurs in the second index. It should be noted here that among the 
Peruvian skulls there are many which show a fronto-occipital deformation. It may 
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Indices Frontal Simotic 
Constants Mean o Mean o 
(a) Peruvians from Pacasmayo ... | 17*°2+°13 (69) | 1°614:09 , 40°34°77 (67) | 9°304°54 
(6b) Peruvians from other localities | 17°2+°13 (102) | 1°98+°09 | 42°9+°64 (99) | 9°49+°45 
| 





(a) — (0) 0+°19 —2°6+ 1-00 





Rhinal Premaxillary 











(a) Peruvians from Pacasmayo 40°0+ °84 (16) | 4°999+°59 | 34°6+°26 (61) | 2°996+°158 
(6) Peruvians from other localities | 39°1+°61 (27) | 4°72+°43 | 34°2+°22 (82) | 2°98+°16 
(a) — (0) +°9+1°04 +°4+ 34 











lene 








be asked: Does such a deformation affect their facial measurements? This question 
ean be easily answered by making a comparison of the means for each character 
from two sets of crania, i.e. crania recorded as having a slight or marked artificial 
deformation in the frontal region and crania preserved in a natural state. Let us 
take the male Peruvian specimens at Oxford for illustration as the state of deforma- 
tion is individually recorded in the catalogue of the Museum there. The indices for 
the two groups just mentioned were calculated separately and the standard deviations 
based on the total male Peruvian crania were employed to obtain the probable 
errors. The constants are: 








Indices Frontal Simotic Rhiual Premaxillary 

















(a) Normal crania | 16°7°19 (43) | 43°9+1-00 (41) | 383% °84 (15) | 33-0+°36 (32) 
(b) Deformed crania | 17°0+°31 (16) | 41°241°66 (15) | 40°0+1°33 (6) | 34°1+°56 (13) 
-—1°7+1°58 -—1:1+°66 











(a) — (0b) —‘3+°36 +2°7 +194 





It will be seen that the ratio of the difference to its probable error is in no case 
greater than 1°7. This indicates that our facial measurements are not affected to 
a significant degree by the artificial deformation practised by the Peruvians as far 
as can be ascertained from the short series available. 


Ill. North American Indian. R.CS. 38 f', 262; Cambridge 8. This series 
represents a miscellaneous collection of North American Indians derived chiefly 
from Vancouver Island and the West. The different tribes are represented by such 
small numbers that there is no alternative to pooling all the available material ; 
it gives means which are of small permanent value. Some specimens show a 
certain degree of frontal deformation, but their facial skeletons were presumably 
not distorted to any appreciable extent. 














— 
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IV. Patagonian. R.C.S. 21g. These specimens came from various districts in 
the neighbourhood of the South-East coast of Patagonia, North of Tierra del Fuego. 
They represent several tribes of rative Indians. 


V. Fuegian. R.C.S. 12 ¢. These came from the South-West coast of Tierra 
del Fuego. 


D. OcEANIC SERIES. 


I. Maori: New Zealand. R.C.S. 44%, 182; Cambridge 107,52; Oxford 29 /, 
62; BM.6¢,1. Of these, 30 males and 9 females came from the North Island, 
principally from Auckland, and a few came from the South Island, but there are 60 
specimens, male and female, the original localities of which are not specified. 


II. Moriort. R.C.S. 33 f, 219; Cambridge 37,19; Oxford 64,42; B.M.1/, 
1¢. These specimens came from various parts of the Chatham Islands and it may 
be assumed that they form a racially homogeneous sample. 


Ill. New British. Cambridge 39g, 242. The specimens were collected by 
Mr A. Willey in the Island of New Britain. 


IV. Islanders of the Marquesas. R.C.S. 22 f,16. The series came from five 
main Islands, viz. Nukahiva, Fatuhiva, Hivaoa, Uahuka and Hiau. . 


V. Easter Islanders. R.C.S. 35 ¢,16$; Leiden 11¢,5$. A detailed description 
of this material is given in Dr von Bonin’s paper: Bm. Vol. xxul. pp. 249—270, 
1931. 


VI. Kanaka: Sandwich Islands, R.C.S. 52 ¢, 542; Cambridge 15’, 62. 
“Kanakas” is the name applied by some anthropologists to the natives of the 
Polynesian Islands generally. In the native language of the Sandwich Islands 
“Kanaka” means “man.” Here it is restricted to the inhabitants of the Sandwich 
Islands alone. More than 85 per cent. of the crania of this series came from the 
Island of Oahu (Woahoo) and the remainder from the Island of Hawaii (Owyhee). 


VII. New Caledonian. R.C.S.18¥. Of these, 10 specimens came from the 
mainland of New Caledonia, 4 from the Island of Pines, a small island South-East 
of New Caledonia, and 4 from the Island of Lifu, the largest of the Loyalty Islands, 
near New Caledonia. 


VIII. Solomon Isianders. R.C.S. 21 ¥, Cambridge 6 ¥. Among these 3 specimens 
are known to have come from the Island of San Cristoval, 1 from Guadalcanao 
and 1 from Isabel. The others are unspecified and only stated as being natives of 
the Solomon Islands 


IX. Jslanders of New Hebrides. R.C.S. 36%. Of these, 24 crania came from 
various Islands of the New Hebrides ranging from the Cherry (Anuda) Island in 
the North to the island of Tanna in the South. These include Eromanga, Ambrym, 
Faté, Api and the Banks Islands. The remaining 12 crania came from the Island 
of Mallicollo, and these are all remarkable for the depression of the frontal region. 
It has not yet been ascertained whether this is a natural conformation, or whether 

14—2 
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it is due to artificial deformation in infancy. The evidence derived from the 
Peruvian series leads us to hope that the facial measurments of any deformed 
crania in the present series would not have been affected appreciably. 


X. Fijian. R.C.S. 30/, 13 2. Out of these, 18 males and 9 females came 
from various burial-grounds on the Island of Viti Levu, the largest island of the 
Fiji group. They are mostly of the coastal tribes. Six males and 4 females came 
from the Island of Ovalau and 6 males from the Island of Vanua-Valavo. 


XI. Papuan. Cambridge 79 ¥, 49 $. The specimens were collected by the 
Daniels’ Expedition, and they came from more than 10 different districts in the 
territory of British New Guinea. 


XII. Islanders of the Cook Group. R.C.S. 19 ~. Of these, 9 came from the 
Island of Atiu (20° 0’ S., 158° 5’ W.), 4 from the Island of Mangaia and the remainder 
from neighbouring small islands. 


XIII. Tasmanian. R.CS. 21%, 182. These specimens were collected by 


various persons in Tasmania. 


XIV—XVI. Australian. It was found convenient to group the native 
Australian skulls measured in the following way: 


(a) Australians from the Northern Territory. R.C.S. 10 /, Oxford 7 #, B.M.1¢. 
These skulls came chiefly from Port Essington, Port Darwin, the region of 
Van Diemen Gulf and the banks of the Victoria River, so all are from the North-West 
of the Northern Territory. 


(6) Australians from Queensland. R.C.S. 25, 10 2; Cambridge 2¢, 1 2; 
Oxford 97, 1 ¢. More than half these specimens came from North Queensland 
and the remainder from South, East and Central Queensland. 


(c) Western Australians. R.C.S.8 ¥,3 2; Cambridge6¥,6 2; BM.3,2 . 
These skulls came chiefly from the North-Western district of Western Australia. 


(d) Australians from New South Wales. R.C.S. 12 ¥,8 2; Cambridge 2 ¥',2 2; 
Oxford 177, 8 $. These specimens came mainly from Sydney, Port Stephens, 
Bathurst, and the neighbourhood of the Hunter, Macleay and Clarence Rivers, 
i.e. from regions close to the Eastern coast of New South Wales. 


(e) South Australians. R.C.S. 35 /, 26 2; Cambridge 11 7,5 2; Oxford 5 ¥, 
49; BM. 1, 22. Most of these came from Adelaide and the banks of the 
Murray River, near the South coast of South Australia. 


(f) Australians from Victoria. R.C.S. 18 ¥,13 2; Cambridge 2 ¥; Oxford 4 ¥, 
1 2. The majority of these crania came from Melbourne, Geelong, Port Phillip and 
Port Fairy, on the South coast of Victoria. A few came from Ballarat and the 
vicinity of the Murray River (Victoria). 


We require to know whether there are any significant differences between the 
characters of the six Australian series. In order to examine this point the means 
of the male series for the four indices were calculated separately and they are given 
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below. The differences between all possible pairs of means together with their pro- 
bable errors are presented in the table below. The standard deviations of the South 
Australian series were used in computing the probable errors of the means for the 
Northern Territory, Western Australian, New South Wales and Victoria groups. 
The differences which exceed 2°5 times their probable errors are in italics. Considering 
the differences between the means in terms of their corresponding probable errors, 
which are shown in the second part of the table below, it will at once be noticed 
that the mean indices of the last four Australian series show no significant differences 
from one another, but the first two series, i.e. Australians from the Northern 
Territory and from Queensland, differ quite significantly in one or two characters 
from one another and also ‘ron the last four series mentioned, the single exception 
being that the Northern Territory series shows no significant differences from the 
South Australian group. Accordingly, the whole of the Australian specimens 
measured can be best regrouped in three series, viz. Australians from the Northern 
Territory (XIV), from Queensland (XV), and from all other parts of the continent 
(XVI). It may be noticed that the frontal indices show no significant differences. 
The general characters of Australian crania—according to Morant’s recent study 















































| Constants Series Frontal Simotic Rhinal Premaxillary 
© taney 188439 (18) | 4194150 (16) | 37-24 -93 (12) | 41-54-60 (17) 
(6) "ones 18°5 +-28 (36) | 37°5+1-01 (36) | 30°14 -67 (18) | 39-54-39 (35) 
‘tions (c) Western Australians | 18°9+°41 (16) | 43°8+1°45 (17) | 33-4+1-08 (9) | 38°9+°75 (11) 
re (@) Australians from 8*5 + 30 ( 3:9+1- 3°74 “81 ( 94: 24K 
New South Wales 18°5+-30 (30) | 43°24+1-09 (30) | 33°7+ -81 (16) 39-2 + °45 (30) 
| (e) South Australians 18°24 °23 (52) | 45°94 -90 (44) | 35°04 °*78(17) | 40°6+°39 (41) 
(f) Australians from | | 1.443423) | 43-341°25 (23) | 33-341°14 (8) | 39°84-54 (21) 
Victoria J 
Standand | ©) Ansteations Som } 2-53+°21 898+ “71 8+ “47 3-414 °28 
Dietintiaual Queensland % a ara ni 
. ” | (e) South Australians 2°45+°16 887+ °62 478+ °55 3°68 + °27 
(a) — (6) 4+°3+°48 +4°4+1°80 +7141°19 +2°04°72 
(a) —(c) —"14°57 — 1°94 2°08 +3°S$ 1°42 +2°6$°96 
(a) —(d) +°3+°49 — 1°3+1°85 +3541°23 + 2°34°76 
(a)—(e) +6 4°45 ~4°0+1°75 42°24 1°22 + ‘94°72 
(a)-(f) +°4$°52 ~ 1441-95 43°94 1°47 +1:7+°81 
(b) —(e) —*4+°50 —~6-341°77 — 3341-26 + 64°85 
a (b) - (d) 0+°41 —5°T+1°49 — 3641-04 + ‘34°60 
of Mesa (b) - (e) +:34°38 | — 8441-35 ~ 4941-08 ~11¥°55 
Se (b)-(f) $°1$°45 ~5°S#1°61 —3°2+1°32 — 34°67 
(ce) — (a) +4451 + 6+1°82 — *B+1°35 — 34°87 
(e) ~(e) +°7+°47 —2141°71 —16+$1°33 -1-7+°84 
(e)-(f) +°5 454 + 5+191 + *1#1°57 — 9+-92 
(d) —(e) +°3+°38 —2:7+1-41 —13+1-12 — 1-44 °60 
(d)—(f) +°1+-46 — ‘141-66 + “441-40 - 6+°71 
(e)-(f) —"2+°41 +2°641°54 +1°741°39 + 8+°67 
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(Bm. Vol. xtx. pp. 417—440, 1927)—suggest a division into two groups, viz. 
Northern Australians, on the one hand, and all the remainder, on the other. This 
accords with the present grouping based on measurements of facial flattening only, 
except that we have also to divide off a Queensland group. The pooled means of 
each character for our three Australian series are given in Table I (p. 222). 


E. Astatic SERIES. 


I. Chinese. Two series of Chinese crania were measured in the museums of 
Holland and England : 


(a) Chinese (Leiden). Leiden 48 ¥, 4 $. Of these, 15 males are marked as 
having come from the provinces of Kwantung and Fukien, South China, 14 males 
and 2 females from Hong Kong and Batavia (Java), and the remaining crania are 
unspecified. There is some reason to believe that these unspecified specimens most 
probably belong to the Southern Chinese and that no Northern elements are 
involved. 

(b) Chinese (English museums). R.C.S. 67 #, 12 $; Cambridge 6 /; Oxford 
13¢,22; BM.3¢. Nearly half of the specimens measured are marked as having 
come from Canton, Macao, Amoy, Hong Kong and Batavia (Java), and these almost 
certainly represent the Southern Chinese. Others are marked as criminals, seamen, 
or merely inscribed as “Chinese,” and no further information is available. These 
unspecified crania, we believe, may be the heads of Southern Chinese as the spellings 
of their names given on the skulls are most likely Cantonese or Fukienese. The 
means and variations of the four indices for both male Chinese series are given in 
the following table. 





Frontal Simotic 





Indices " is a l 


Mean | Mean o 


q 


























(a) Chinese (Leiden) 15°9+ °22 (48) | 2°294+°16 | 32°84 °77 (48) | 7°92+°55 
(b) Chinese (England) | 15°8+-°16 (88) | 2°15+°11 | 31°14 °86 (89) | 11°97+°61 | 
(a) — (b) +1 ++28 417TH 115 | 
| 
| 
Rhinal Premaxillary 
= |- : ¢ |. ae | u | 
(a) Chinese (Leiden) | 31°9+ °52 (43) | 5°08+ °37 | 34°8 + °32 (44) 3°144+°23 | 

(b) Chinese (England) | 31°2+°43 (66) | 5°19+°31 | 33-94-28 (77) 357 4°19 
(a) —(b) | +'°7+°68 — +°9+°42 — | 

{ 











Again we find that for all the characters with which we are dealing the two 
series differ insignificantly from one another, although these samples were collected 
in various parts of South China and its neighbouring islands. This accords with the 
belief that the Chinese race is homogeneous in its physical type. 
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II. Burmese. Two series of Burmese skulls were measured. 


(a) Burmese A Type. Biom. Lab. 44,7, 39 $. A southern series from the 
neighbourhood of Moulmein was divided into three groups of which the Burmese A, 
according to Miss Tildesley’s study (Bm. Vol. x11. pp. 176—262, 1921), is supposed 


to be true Burman. 

(b) Burmese in general. R.C.S. 32 /; Oxford 10°. The specimens in these 
museums are only marked “from Burma.” Their localities are mostly unspecified 
with the exception of 7 cases marked “from Rangoon or Pegu.” All skulls known 
to represent the primitive native races of Burma were omitted. 


The differences of the mean indices between the two male series are tested in 
the usual way. 





Frontal Simotic 





Mean o Mean c 


| 
| 


| 
Indices | 
| 











(a) Burmese A ... | 15°8+°28 (42) | 2°724+-20 | 32°94 -97 (41) | 9-24+-69 
(6) Burmese in general | 16°2+°16 (42) | 1°574°12 | 33°34 °97 (41) | 9°2524°69 
| (a) —(b) | —°4£°33 = | —"4+1°38 [ - = 
| | 
| 
Rhinal | Premaxillary 
Sa _ | oe 
Aare a 
(a) Burmese A AS | 28°7 + 52 (21) | 3°53+ °37 | 32°7 +°35 (34) | 3°00+°25 
(6) Burmese in general | 29°8+ *43 (36 | 3°804+°30 | 33°64+°32 (36) | 2°85+-23 | 
(a) — (6) —1:1+°67 — 


| —~-9+4-°47 | = 
| | | 
| | | 





It is seen that so far as the indices of facial flattening are concerned the two 
series do not differ significantly from one another. It is, therefore, justifiable to 
pool them for the present comparative purposes. 


III. Japanese. R.C.S.9/, Cambridge 1 ¥, Oxford 47, B.M.3 ¥. The series isa 
short one as there are only a few Japanese specimens in English museums. Only three 
crania are marked as having come from Tokio, Yokohama and Kyoto respectively, 
the others being from unknown parts of Japan. They are all of modern date. 

IV. Aino. RCS. 77, BM. 1g. Of these, 5 came from the island of 
Yezo (Hokkaido), the homeland of the Aino, 2 came from the Kurile Islands and 
1 from an unknown locality. 


V. Tibetan A. R.C.S.17 , B.M. 20 ¢. These specimens came from the South- 


West of the country and are classified by Morant as of the Tibetan A Type. See 
Bm. Vol. X1v. pp. 1983—260, 1923, and Vol. XVI. pp. 1—105, 1924, 





VI. Tibetan B. R.CS.15 ~. This series represents soldiers from the Eastern 
Province of Khams who are said to belong to the Tibetan B Type. See Morant, 
loc. cit. 1923. 
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VII. Nepalese. B.M. 48 ¥*. These crania came from different parts of Nepal. 
See Bm. Vol. Xv1. pp. 1—105, 1924. 


VIII. Malay. R.C.S.10 ¥. These specimens were collected in the Malay States. 


IX. Andamanese. R.C.S. 17 f, 16 $; Cambridge 14,7, 99. These specimens 
came mainly from the Great Andaman Islands although several crania are unspecified. 
South Andaman is better represented than the other islands. 


X. Nicobarese. Cambridge 14. These specimens were collected by J. M. 
Wright in 1923—24. 


XI. Naga. Biom. Lab, 23 ¥, 15 %. These specimens were on loan to the 
Biometric Laboratory at the time when they were measured (see Bm. Vol. xxv. 
pp. 1—20, 1933). They represent natives of the Naga Hills in Eastern Assam. 


XII. Javanese. The Javanese skulls measured may be divided into the four 
groups below. 


(a) Javanese from Bantam and Batavia. Leiden 55,1292. This series came 
from the provinces of Bantam and Batavia in the Western part of the island. See 
Bm. Vol. xxi. pp. 52—113, 1931. 

(b) Javanese from Middle and East Java. Leiden 69%, 159. These skulls 
came principally from the Middle and East of the island although a few unspecified 
skulls are also included. See ibid. 

(c) Javanese from Madura. R.C.S. 10 /, Leiden 13 /. 


(d) Javanese in general. 


parts of Java. 


R.C.S. 30¢, 72. These skulls came from various 




















Constants Series Frontal Simotic Rhinal 
(a) Javanese (B.B.) 155 +°20 (54) | 33°4 + ‘92 (53) | 30°4 + °55 (46) | 35°3 +° 
Me: (b) " (M.E.) 15°99 +°17 (69) | 32°7 + °82 (67) | 30°4 + ‘37 (61) | 34°71 + 
ee (ec) » (Madura) | 16°0 +-31* (23) | 31°6 +1°44 (22) | 30°7 + -71 (21) | 35°0 +: 
(d) ,, (General) | 16°1 +-26* (30) | 334 41-24 (30) | 30°5 + +73 (20) | 33-9 +- 
Standard | (a) Javanese (B.B.) 2°18+°14 9°88+ °65 554+ °39 4°01+° 
Deviations | (b)  ,,  (M.E.) 2°08 + °12 10°134 °58 425+ +26 3°87+° 
(a) —(b) -"4 +°26 + *7 £1°23 0 + ‘66 +1°2 +° 
(a) —(c) —°5 +°37 +1°8 +1°71 -"3 + °90 + °3 +" 
Differences (a) —(d) —"6 +°33 O +1°54 -"l + ‘91 +14 +° 
of Means (b) —(e) —'1l +°35 +11 +1°66 -°3 + °80 - 9 +° 
. (b) -(d) —"2 +°31 — ‘7 +1°49 -—l + ‘81 + "2 +° 
(ce) —(d) —*l +°40 —1°8 +1°90 +°2 +1°01 +l°l +° 

1 























Premaxillary 








FX FERS | 
* he mean weighted value of the standard deviations of the first two series ( ¢ = 3 at a ) for 


each index was used in calculating the probable errors of the last two series. 


Ny + Ny 
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As the mean differences of the indices between all possible pairs of these series, 
which are shown in the table on p. 216, are too small to be considered significant, 
the four Javanese series have justifiably been pooled for the present purpose and 
the combined means are given in Table I. It has been shown by von Bonin that 
cranial series from different parts of Java are very similar to one another in type. 


XIII. Sumatran. R.CS. 8¢, Leiden 48¢. These crania came from various 
parts of the country. A large number of them are said to come from the residencies 
of Palembang and Lampong and from the West coast of Sumatra. Several from 
Nias and other neighbouring islands have been included. 


XIV. Natives of Celebes. R.C.S. 9 ¥', Leiden 43 ¥. This series is chiefly made 
up of the skulls of Macassars and Bugis. 


XV. Natives of Sarawak. R.C.S.9¥, Cambridge 46 f, 249. The specimens 
preserved in the two museums were all collected by Sir Charles Hose in Borneo 
in 1899. They represent various tribes belonging to different districts of Sarawak, 
chiefly those of the Murut group. They were mostly killed by Long Pata, Long 
Kiputs or other tribes in the Baram district, Sarawak. 


XVI. Dayak. R.CS. 13, 83, Leiden 427, 102. See Bm. Vol. xxii. 
pp. 52—113, 1931, for an account of the Leiden series. As all these skulls came 
from Borneo and are marked as Dayaks, the two series may justifiably be pooled. 


XVII. Tagal. Leiden 314,19 $. See ibid. The bulk of these crania came 
from Manila, Mindanao and Central Luzon, Philippine Islands. The Tagals are 
usually classified as one of the brown, non-negrito tribes of the islands. 


XVIII. Aéa. Leiden 334°, 14. See ibid. The Aétas (Philippine Islands) 
are found in the interior of Luzon Island and in the islands of Mindoro, Panay and 
Negros, and also in the North-Eastern part of Mindanao. The people are very short 
in stature and are usually classed as Negritoes. 


XIX. Singalese. R.C.S. 24 f', 10 2; Cambridge 1/, 22; Oxford 97, 19. 
The majority of these specimens came from Colombo, Ceylon, and a few from 
different parts of the Western and Central Provinces of Ceylon. The crania of peoples 
other than Singalese living in the island were excluded. 


XX. Veddah. RCS. 167, 102; Cambridge 2, 12; Oxford 3, 3 . 
These skulls came mainly from the South-Western part of the Eastern Province of 
Ceylon. 


XXI. Punjabi. The Indians living in the Province of Punjab are, according to 
their religions, chiefly Mohammedan and Hindus. It is supposed that the people 
of the two groups do not inter-marry. The specimens measured represent: (@) Moham- 
medans. R.C.S. 39, Cambridge 6. This series comprises all crania inscribed as 
Mohammedan or Moslem from the Punjab. (b) Hindus. R.CS. 42 ¢°, Cambridge 6 #. 
This series consists of various castes of Hindus among which the low castes are 
better represented than the others. From the figures shown below it will be seen 
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that no significant difference is found between any corresponding pair of indices for 
these two series. They have therefore been pooled. 

















| 
| 
| Frontal Simotic 
Series — | ———$— 
| Mean o Mean o 
Qa ee 
“a 
(a) Mohammedans | 21°2+°'28 (45) | 2°794°20 | 49°641-28 (45) | 12°72+-90 
(b) Hindus | 20-94-25 (48) | 2°554-18 | 49-241-26 (47) | 12°794-89 | 
(a) — (6) +°3+°37 om | +°4+1°79 — 
| | 
Rhinal Premaxillary 
aS tar ah tee a ioe ea = Sas Sl Ea Be TS a ah Nags 2 
} 
| (#) Mohammedans | 43°54 ‘77 (23) | 5-454 °54 | 36-64-37 (39 | 3°41+-°26 
| (db) Hindus 43°6+1°44 (20) | 9°554+1°02 | 35°74+°39(40) | 3°64+°28 
(a) —(6) | — 11°63 - 
| 


= | 4-94-54 





XXII. Hindu: Madras Presidency. R.C.S. 49 ¢. These skulls came from 
the following localities: (a@) 30 crania from Berhampur in the North-East of the 
Presidency; (6) 6 from the Bellary district, in the Western part of the Presidency; 
(c) 6 from a burial-ground near Madras, and (d) 7 from the Salem district in the 
South of the Presidency. All these specimens have been pooled for the purpose 
of the present paper. 


XXIII. Hindu: North-East India. There are three more Hindu series which 
were measured in different museums. These came from the Provinces of Bengal, Bihar 
and Orissa, and the United Provinces, all in the North-East of India, so they may 
be conveniently dealt with together. The origin of these series is described as 
follows: 

(a) Bengalese. R.C.S. 88 J, 38 2; Cambridge 8 ¥,9 $; Oxford 19 ¥, 5 &. 
The majority of these specimens represent various castes of Hindus living in 
different districts of Bengal. Several crania of Mohammedans which came from 
the same province are also included. 

(6) Hindus from Bihar and Orissa. R.C.S. 37 ¢. These skulls represent various 
castes of Hindus living in the two provinces mentioned. The specimens came 
largely from the Patna district in the North-West of Bihar. 

(c) Hindus from the United Provinces. R.C.S. 26 #; Cambridge 1 ¥. These 
specimens represent several castes of Hindus who inhabited the United Provinces. 
The low castes of Hindus are better represented than the high. 

The significance of the differences between the mean indices for these three 
series and those from the Punjab and Madras (Nos. XXI and XXII above) is 


tested as usual and the constants are given in the Table on p. 220. The differences 
which exceed 2°5 times their probable errors are in italics. 
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From the figures shown in the lower part of the table (p. 220) it may be observed 
that the Punjabi (X XI) and the Southern Hindus (XXI1) differ significantly from 
one another in a slight degree in the case of the first two indices; also, both these 
series show slightly significant differences, in one or two characters, from all the 
three series from North-East India, with the single exception that no significant 
difference is found between the series from Bihar and Orissa and that from Madras 
Presidency (XXII and XXIIIb). The last two represent adjoining regions. The 
largest difference, which is 43 times its probable error, is found for the rhinal 
index between the Punjabi (XXI) and the Bengalese (XXIII a), which represent 
regions as far apart as any pair. In the case of the last three Hindu series 
(XXIII a—c) the differences of the means for all possible pairs of indices are 
quite insignificant. This shows that the degree of “facial flatness” of these 
samples from an enormous Indian population is a constant feature as far as can 
be told from the slender evidence available, and such a result agrees with that 
of a previous study which was based on the usual cranial measurements of several 
short Hindu series: see Bm. Vol. xx®. pp. 294—300, 1928. In view of the com- 
parisons made above, the five East Indian series measured may be justifiably 
reduced to three groups, viz. the skulls from the Punjab, those from North-East 
India (XXIII a—c) and those from Madras Presidency. The grouping seems to 
be a reasonable one which accords with the geographical positions of the peoples 
compared. 


XXIV. Dravidian. R.C.S. 36%. These specimens belong to two short series: 
(a) 18 crania which came chiefly from the Madura district (Madras Presidency), 
and belong to the Maravar tribe, which is Dravidian; (6) 18 crania from the 
Tanjore district (Madras Presidency), and no detailed information relating to these 
is available, but the locality suggests that they are most probably of Dravidian 
origin. 


XXV. Afghan. RCS. 18 ~. These crania came from various parts of 

fchanistan. Six of them are recorded as belonging to the Yusafzai tribes in 
Afgl t Six of tl orded as belonging to the Yusafzai trib 
the Peshawar Valley of this country. 


XXVI. Arab. R.C.S. 10. This collection came from scattered parts of the 
country including Western (the coast of Midian), Southern (Hadramaut) and 
Eastern (Oman) Arabia. 


XXVIII. Natives of Syria and Palestine. R.C.S. 10 f from Syria, 11 ¥ from 
Palestine; Cambridge 4,~ from Syria. This series came from several cemeteries 
in various districts of the two countries. Owing to the small number of specimens 
measured the pooling of these crania from the two neighbouring countries has 
been considered desirable for the purpose of the present investigation only. 
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(4) Sexual Differences of Mean Values. The material dealt with consists of 
83 male and 48 female series, and in general the female series are considerably 
shorter than the corresponding male series. Table I gives their means for the 
eight absolute measurements and four indices together with the probable errors. 
The sexual differences of these constants may be considered first. The sex ratios 
(i.e. the male means divided by the female means) are given in Table II for the 
eight absolute measurements in all cases where the means for both sexes are 
based on 20 or more skulls. Values have been given previously for a number 
of the larger calvarial arcs and chords in the case of seven European and African 
series*, These show that the ratio may be peculiar to the particular character 
considered, but the range found for all of them only extended from 1-024 to 1066. 
It can be seen from Table IT that all the sex ratios for the internal biorbital 
breadth (JG W)—a calvarial chord—fall within the same range. Little significance 
can be attached to such a range, of course, since the extreme values may be 
affected appreciably by errors of random sampling, or the inclusion of any series 
for which the male and female means do not represent precisely the same racial 
type is likely to extend the extremes considerably. While showing a few ratios 
less than 1-024 or greater than 1:066, most of these for MOW, GB, Sub. JOW and 
Sub. GB lie between those values. As far as can be seen the sex ratios for these 
characters are not peculiar, and the subtenses to JOW and GB have approximately 
the same values as the chords. But for the remaining measurements the position 
is different. For the simotic chord (SC) there are seven ratios less than unity, the 
female mean being greater than the male, and 14 of the 27 ratios are less than 
1:024. The simotic subtense (SS), however, has only one value less than 1-024, 
but 24 greater than 1-066. For every series the ratio for the subtense is greater 
than that for the chord. The last character, the rhinal subtense (Sub. MO W), also 
has high sex ratios, the lowest of the 11 that can be given being 1078, while 
again the value for the subtense is invariably greater than the value for the chord 
(MO W) in the case of the same series. We see then that the simotic chord shows 
peculiarly small sex ratios while the simotic and rhinal subtenses are also dis- 
tinguished, but on account of their peculiarly large ratios. The interracial averages 
shown in Table II bear out these conclusions. The breadth of the nasal bones 
shows relatively small differences between man and woman, but there are relatively 
large sexual differences between measurements of the projection of the nasal bones, 
both at the bridge of the nose and at their lower extremity. It may be anticipated 
that large sexual differences will be found between mean simotic and rhinal indices 
for the same series, and possibly no markedly significant sexual differences between 
the mean frontal and premaxillary indices. The fact that these relations do hold 
can be appreciated most readily from Figs. 1 and 2. The first figure shows the 
interracial correlation between the simotie and rhinal indices, all available means 
based on 15 or more skulls being shown, and it can be seen at a glance that in the 
case of both these indices every male mean is greater than every corresponding 
female value. This is a particularly striking relation since several of the series are 


* See Biometrika, Vol, xx. pp. 214 and 275, 1931, 
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“Flatness” of the Facial Skeleton in Man 


TABLE II. 


Sea Ratios of the Absolute Measurements (all means used based on Twenty or more Skulls). 









































Subtenses Chords 
Continental] Serial Series* — 
Area 0. Sub Sub Sub 
Su ub. ub. ; S 
IOW SS Mow! GB IOW Sc | MOW GB 
A. III | Anglo-Saxon - 1:058 | 1-200 1°044 | 1044 | 1°046| — | 1°059 
European Vv English : Farringdon St. | 1-057 | 1:150 | 1-117 | 1-065 | 1-042 | 1-011 | 1-039 | 1-065 
Series XII Italian “a 1°055 | 1°154 | 1°158 | 1°058 | 1:064 | 1°032 | 1°057 | 1°0 
VI English : Spitalfields 1°047 | 1°047) = — |1:049/1:010; — — 
Il Romano-British ... 1-034 | 1°095| — | 1°045} 1-026] 1°033 |} — | 1°059 
Average ratios 1°050 | 1°129 | 1°137 | 1°053 | 1°047 | 1°026 | 1°048 | 1°065 
B. IV_ | Egyptian: Sedment 1°115 | 1°026 | 1°124 | 1:065 | 1:061 | °950 | 1-050 | 1-050 
African I Guanche ale 1°074 | 1:190; — | 1:070; 1°060| -990} — | 1:058 
Series V Egyptian: Kerma 1°074| 17188} - 1-024 | 1°042 | 1°051 | — | 1°046 
XII | Congo Negro 1°056 | 1°087 1°012 | 1°036 | 1-000 — 1-011 
VII Gaboon Negro _... .. | 1042 | 1°104 | 1-078 | 1°048 | 1°047 | 959 | 1°053 | 1-057 
X Negro: Teita Hills | 1°036 | 1°158 | 1-148 | 1°081 | 1°050 | +922 | 1°029 | 1045 
II Nubian | 1°029 | 1°313 - 1°018 | 1°026 | 1°114 -— 1°040 
IX | Ashanti | 1:024 | 1°381} — | 1-040 | 1-057 | 1100) — 1°077 
ee (Sa eS ae ee % S orl ans Brae SE Rs Tne See Se tad 
Average ratios . | 1°056 | 1°181 | 1°117 1°039 | 1°047 | 1°011 | 1°044 | 1°048 
Nerte M <a ph ee, Nor © Chess SEs, CP 2 
C. I] | Peruvian 1°07 | 1°188 | 1-118 | 1°27 | 1-042 | 1°022 | 1-078 | 1°057 | 
American) I | Eskimo . | 1-029 | 1-263} — | 1-016] 1-051] 1-056 | — | 1-078 
Series It | North American Indian. 1-018 | 1°125 | — | 1°025 | 1°046 | 1°024) — | 1°060 
Xe Average ratios 1°041 | 1°192 | 1:118 | 1°023 | 1°046 | 1°034 | 1°078 1-065 
b eagd) 4 Spe aie. Oe es pores eu eo fee Pian Ae 
D. I Maori 1°104 | 1°240) —- | 1°026| 1°056| 1°029| - | 1:057 
Oceanic XI Papuan... 1°112 | 1°143 | 1-107 | 1°027 | 1°044 -988 | 1-053 | 1°056 
Series XVI | Australian: “Otherregions” 1°081 | 1°367 | 1°167 | 1°042 | 1°058 | 1-057 | 1-036 | 1°047 
VI Kanaka 1°067 | 1°286 | 1°144 | 1-024 | 1°053 | 1-028 | 1:066 | 1-070 
IT Moriori 1°018 | 1°077 1°060 | 1°049 | °855| -- | 1°070 
III New British 1°013 | 1:286 1°054 | 1°060 | 1:012} — | 1057 
Average ratios 1:049 | 1°233 1°139 | 1°039 | 1°053 | *955 | 1°052 | 1°059 
E. II 3urmese 1°098 | 1°200} — 1°077 | 1°055 | 1°047} — | 1°045 
Asiatic XII Javanese ... pte 1-077 | 1°167 | 1°102 | 1°030 | 1-040 | 1°000 | 1-020 | 1052 | 
Series XXIII | Hindu: N.E. India 1°072 | 1°171 | 1-207 | 1°050 | 1-039 | 1°024 | 1°071 | 1°025 
IX Andamanese 3 1°033 | 1°091 — | 1°031 | 1:028]1°022| — 1°057 
XV_. | Sarawak (Natives) 1032 |. 963} — | -— |1-049| 930] - 
| 
Average ratios con | 1°062 | 1°126 | 1°155 | 1:047 | 1°042 | 1°005 | 1°045 | 1°045 
SEC wer LSYy Se te as pe! Sed) Proeaes. 
Average of all Series 1°053 | 1°173 | 1134 | 1°050 | 1°048 | 1°012 | 1°040 | 1°055 
(27) | (27) | (1) | (25) | (27) | (27) | Gd) | (25) 










































































* The series for each continental area are arranged in order according to the size of the sex ratios 
for the first character. 
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small, and it may be confidently asserted that for the same series no one of the 
usual cranial indices for which sexual differences are found would have shown a 
like uniformity. The fact that marked sexual differences are found for the simotie 
index was first noted by de Mérejkowsky, and it was confirmed by Ryley, Bell and 
Pearson. Table III gives the absolute differences of the mean indices and these 
quantities expressed in terms of their probable errors. It will be seen that most 
of the sexual differences of the simotic and rhinal indices are significant, and many 
are markedly significant. The degree of significance tends to be greater for the 
simotic than for the rhinal index, but it must be remembered that the means for 
the former are based on considerably larger numbers than the means for the latter 
in the case of most of the series. For these two characters there are evidently 
marked sexual differences for all modern races. Fig. 2 and Table III enable us to 
examine the frontal and premaxillary indices in the same way. For the former 
there is really no suggestion of sexual differentiation. For 15 series the male 
index is the greater, for 11 the position is reversed and there is absolute equality 
in one case. The most significant sexual difference is only 3:09 times its probable 
error. The premaxillary index is a measurement of shape, which indicates less 
similarity between the sexes. There are six series having the male mean greater, 
18 having the female mean greater, and equality is shown for one series. No 
differences are markedly significant, but the three which exceed three times their 
probable errors all have the female mean greater. It is quite likely that the female 
index would be found invariably greater than the male in the case of large samples, 
but sexual differentiation here is obviously much less than that observed between 
the shapes of the nasal bones. 


We may conclude that the male nasal bones are, on the average, decidedly 
more protruding than the female, while the premaxillary region tends to be less 
protruding in the male. The last relation will clearly not help in aiding the sexing 
of individual crania, and the rhinal index is of little value for the same purpose 
since the tips of the nasal bones are so often found to be defective. It may be 
doubted, too, whether the simotic index is of much value in aiding sexual dis- 
crimination owing to the fact that the intraracial variability of the character 
is very large. The difference between the means for the two sexes (Table IIT) is 
in all cases but one (Australians: “Other” regions) less than both male and female 
standard deviations for the same series (Table V), and in most cases very decidedly 
less. In these circumstances the simotic index can be of little value by itself in 
aiding “mathematical sexing,” though the form of the nasal bridge is a character 


which may profitably be considered in conjunction with many others in anatomical 
sexing. 


These data may be considered from another point of view. Do they suggest 
that sexual differences are greater for “advanced” or for “primitive” races? More 
and larger samples would obviously be needed to give any decisive answer to this 
question. The sex ratios for the simotic subtense and the sexual differences for the 
simotic index need only be considered in this connection, and it may be seen from 
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the constants in Tables II (p. 226) and III (p. 229) that the races of Asia and 
Europe show a slight tendency to be less differentiated sexually than those of the 
other regions, while the Oceanic races tend to show the greatest sexual differences. 


(5) Seaual and Racial Differences in Variability. Table IV gives the standard 
deviations and coefficients of variation for all the racial distributions of absolute 
measurements made up by 50 or more individuals: Table V gives the standard 
deviations of the indices for all the distributions made up by 30 or more individuals. 
The sexual differences in variability may be considered first. Few comparisons can 
be made in the case of the absolute measurements, but the data suggest that 
female variation for the characters considered shows a slight tendency to be greater 
than male variation. Combining all the measurements, there are 18 cases in which 
the male coefficient of variation exceeds the female and 26 in which the position 
is reversed. The most significant difference is only 34 times its probable error, and 
among the four cases showing this ratio greater than 3-0 the female index is the 
greater in three. It is curious that the position appears to be different when the 
indices are considered. In 44 comparisons the male standard deviation is the greater 
and for the remaining 14 the female constant is greater than the male. In six cases 
the sexual difference in variability exceeds 3:0 times its probable error, the greatest 
ratio being 64, and for all these the male constant is the greater. The dominance 
of male variation for the indices may also be appreciated by comparing the weighted 
mean standard deviations given in Table V. Out of all possible sexual comparisons 
between these which can be made for uche four indices and five continental areas, 
there are 16 cases in which the male constant is in excess of the female and only 
tour for which the position is reversed. We must conclude that for these measure- 
ments of shape male variation shows a distinct tendency to be greater than female, 
which for the measurements of size, judging by coefficients of variation, there is 
either sexual equality in variability, or females show a slight tendency to be more 
variable than males of the same race. The material is obviously too meagre to 
investigate whether there are any differences in the preponderance of variability 
of one sex over the other between different continental areas. Racial comparisons 
of the constants of variability may be considered next and it will be sufficient if 
we confine our attention to the male indices in this connection. The series in 
Table V are arranged in order of their standard deviations for the frontal index 
in the case of each continental area. The range of values for each area is quite 
wide and there is no difficulty in selecting pairs belonging to the same group 
which differ quite significantly. This may not indicate in every case that the racial 
populations from which the samples were drawn really differ in variability for this 
particular character, as several of the samples may not have been randomly chosen 
owing to reasons of which we are ignorant. If the distributions of the standard 
deviations for the five continental areas are compared it will be seen that they are 
all overlapping and the inter-group means—excluding the American which can 
only be based on three series—are almost identical. The same is found in the case 
of the premaxillary index, and so few series are available for the rhinal index that 
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TABLE V. Standard Deviations of the Indices of Facial Flattening*. 





















































Continental | Serial Frontal Simotic Rhinal Premaxillar 
Area No. Race Bex Index Index Index Index : 
A. VI | English: Spitalfields ... | ¢ | 1°704+-05 (248) | 11°794 °50 (126) | 5°154°35 (50) | 3°084-17 (73) 
Europe IV English: Mediaeval eee | gy | L°874°14 (43) | 12°79+1°02 (36) 3°67+°30 (35) 
XX Russian... es | 95 | 2°03+°16 (35) | 11°72+4 93 (36) _— 2°66+°22 (33) 
Ilt Anglo-Saxon ae ees | 99 | 2°084°10 (92)| 9°294 °51 (77) — 3°65+°20 (76) 
XI French aie See es | 99 | 2114°15 (46)| 8°98+ °65 (43) 4:114°31 (39) 
Il Romano-British wee | 59 | 2°184°11 (86) | 12°51+4 -66 (82) | 5°56+°48 (30) | 2°964°18 (64) 
V English: Farringdon St.... | ,, | 2°19+°12 (75) | 12°75+ °68 (81) | 6°16+°48 (37) | 3°674°24 (53) 
XVIII | Modern Greek _... .-» | 95 | 2°264°19 (33) | 10°02+ °85 (32) — _ 
VIII | Modern Scottish ... ... | ,, | 2°304-19 (35)| 9°404 -79 (32) = 3°14+°27 (31) 
x North German .... eee | yy | 2°314°20 (32) | 11°9041°04 (30) —- 3°65+°31 (31) 
XII Italian oe ies cee | 99) | 2°334°11 (100) | 11°40+ °54 (100) | 5°78+°38 (52) | 3°-444°17 (94) 
XIV | Swedish... on ess | 99 | 2°364°21 (30) = — 
VII Modera Irish we eee | 99 | 2°374°14 (65)]11°704 °69 (65) | 5°35+°39 (44) | 2°654°17 (57) 
IX British es > eee | gy | 2°454°11 (113) | 137134 +59 (111) | 6054-31 (84) | 3°124°15 (101 
XVI | Dutch 2... | 5p | 29494217 (51) | T1154 74 (51) ae 3B2Q+ D4 (45) 
Weigited Mean (c) » | 2°84 [15] | 11°64 [14] | 5°77 [6] | 3°32 [13] 
III Anglo-Saxon ‘ -- | 9 |1°59+°11 (45)| 9°32+ °69 (41) _ 2°824+°21 (40) 
VI English : Spitalfields » | 1°764°09 (94)| 11°63+4 °97 (31) — -- 
IT Romano-British woe | gp | 2°144°15 (45) | 10°734+ “79 (42) — 3°56+°29 (39) 
V English: Farringdon St.... | ,, | 2°154°12 (75)|10°19+ °56 (76) | 5°27+°37 (47) | 3°99+4°27 (51) 
XII | Italian... : we | yy | 2644-22 (32) | 10°67+ -91 (31) 2 
Weighted Mean (c) ost A Pare [5] | 10°42 [5] | 5°27 [1] | 3°53 [3] 
a for Se See Se Rarer RE | ES eee Let a. 
B. IX Ashanti rr ese e- | & | 1°794°15 (33) | 10°66+ °87 (34) 3°15+°27 (30) 
Africa VI Negro : Nigeria... =... | 4, | 1°824°12 (52)| 9°664 66 (49) 3°604°25 (46) 
Ill Egyptian: Badari eee | 95 | 2°094°18 (32)| 707+ °58 (34) = 3°37 +°28 (34) 
VII Gaboon Negro » | 2°18+°12 (76)| 8°70+ °49 (71) | 4°18+°28 (49) | 3°27+°20 (63) 
XII Congo Negro wee | 59 | 2°214°14 (60)] 8°23+ °51 (60) -- 4°264°31 (42) 
xX Teita Negro a ee | gy) | 2°244°14 (55)| 79694 °59 (39) 2°714°23 (31) 
IV | Egyptian: Sedment =... | ,, | 2334-18 (40)| 10-014 -79 (37) = 3°384°27 (37) 
i 4 Egyptian: Kerma woe |g) | 2°41 4°11 (117) | 9°19 °44 (99) —- 3°62 +°17 (103) 
XV_ | Kaftir ves ase wee | gy |: 2°434°16 (54) | 11°944 78 (54) 3°634°24 (51) 
II Nubian... 0. ss | gp | 2°49$°14 (69)| 8094 “52 (55) 3°624+°27 (40) 
Weighted Mean (@) ase: | ig eee [10] | 9°31 [10] | 4°18 [1] | 3°52 [10] 
XI Congo Negro Hee oe | 2 | 1°874°16 (80) — —- 
II Nubian ee, » |190+°12 (61)| 615+ -48 (38) _ 2564-21 (35) 
IV Egyptian: Sedment » | 2°00+°17 (30) — - 
VII Gaboon Negro » | 2°01+°12 (60)| 842+ “5d (55) | 3°96 + °29 (44) | 3°244°22 (51) 
IX Ashanti , » | 2°08+°18 (31) | 10°96+ °95 (30) 
x Teita Negro a » | 2°09+°13 (62)| 716+ -49 (49) | 4°26+°37 (31) | 3°694°29 (37) 
Vv Egyptian: Kerma 9 | 2°19+°11 (94)| 9°494 °53 (74) 3°00+°16 (83) | 
Weighted Mean (¢) a) » 1 [7]| 8°58 [5] | 4°09 [2] | 3°12 [4] 


























* The numbers in round brackets denote the numbers of skulls on which the standard deviations are based and none was 
calculated for series made up by fewer than 30 specimens. The weighted mean @’s for the different continental areas were found 
24 NoGo? +... ‘ . 
from the formula ¢= a/' = 4 rate ‘4 , and the numbers in square brackets denote the number of series on which they are 
1tMgt... 
based. The series for a particular continental area are arranged in order of the standard deviations of the frontal indices. 
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TABLE V—(continued). 
Continental | Serial Frontal Simotic Bhinal Pre 
Area No. ace Sex Index Index Index Index 
C. Ill North American Indian ... | ¢ | 1°744°12 (45)|11°314 °80 (46) — 2°934+°22 (41) 
| America Il Peruvian » | 1°814°07 (171) | 951+ °35 (166) | 4°844°35 (43) | 3°00+-12 (143) 
I Eskimo » |1°964°13 (51)| 886+ °60 (50) — 3°234+°23 (45) 
| aia Ley | — “ 
Weighted Mean (#) 3 | 1°83 (3]| 9°73 [3] | 4°84 [1] | 3-03 [3] 
II | Peruvian Q | 2094-09 (114) | 7-364 +34 (107) | 5254-56 (20) | 2964-14 (97) 
| I Eskimo » | b72+°15 (30) — — — 
Weighted Mean (¢) > 128 (2]| 7°36 [1] | 5°25 [1] | 2-96 {] 
D. II | Moriori g |1°844°13 (43)| 11-894 -88 (42) — 2°88 +°22- (39) 
Oceania XI Papuan » | 1°84+°10 (76) | 10°04+ °54 (78) | 5°064°35 (47) | 3°574°21 (65) 
III New British 5 | 1°85+°14 (39) | 10°54+ °81 (39) | 3°-464°38 (33) | 2°404°19 (35) 
V Easter Island - | ,, | 1°934°10 (41) | 12°24+ *88 (44) — 3°374°27 (36) 
x Fijian 5, | 1°98+°17 (30) | 10°85+ +95 (30) = ne 
I | Maori vs nee ase | gg | OLE “12 (89) | 12°20H -62 (87) | 5-114-40 (38) | 3464-19 (76) 
VI Kanaka: Sandwich Islands | ,, 2°09+°12 (67)|11°81+ °71 (63) | 5°394°38 (47) | 2°80+°18 (54) 
IX New Hebridean .- | 4, | 2°234°18 (36) | 10°13+ °83 (34) _ 3°744+°31 (33) 
XV Australian: Queensland... | ,, | 2°534°20 (36)| 898+ ‘71 (36) -- 3°414°28 (35) 
XVI Australian: “Other” regions | ,, | 2°994+°13 (121) | 10°04+ °45 (114) | 4°624+°31 (50) | 3°60+°17 (103) 
Weighted Mean (c) roan ee | 2°26 [10] | 11°22 [10] | 4°83 [5] | 3°34 [9] 
XI Papuan ate .» | Q | 1°864°13 (49) | 845+ 58 (49) | 3°954°32 (34) | 4:034+°30 (40) 
VI Kanaka: Sandwich Islands | ,, | 2°05+°13 (60) | 11°05+ “68 (60) | 3°814°30 (37) | 2°98+°19 (57 
XVI Australian: “Other” regions | ,, | 1°944+°10 (80)| 8°264 “46 (74) | 4°224+°39 (27) | 3°414°19 (71) 
Weighted Mean (&) Bo Oe [3]| 9°31 [3] | 3°97 [3] | 3°43 [3] 
| E. XIII | Sumatran g | 1:584°10 (56)| 10-144 -G5 (56) | 5°87+°39 (52) | 3614-24 (50) 
| Asia XVIIT | Aéta » | 1°644°14 (38) - = ae 
XVI1 | Tagal 4, | 1°65+°14 (31) — -— — 
IX Andamanese | 1°83+°16 (31) — -- — 
XVI Dayak one | . | 2°00+°13 (55)| 985+ °67 (50) | 4°464°32 (44) | 3°324°23 (46) 
XV Sarawak (Natives) yy | 2°044°13 (55) | 10°05+ °65 (55) _ 4°56+°33 (43) 
V | Tibetan A ... ” | 2:064°16 (37)| 833+ °67 (35)| 5714-47 (34) | 3-214-26 (36) 
XXII | Hindu: Madras 5, | 2°06+°14 (49) | 10°444 *73 (46) —- 3°444+°24 (48) 
XII | Javanese ” | 2°134-08 (176) | 9°75+ °36 (172) | 4°82+°19 (148) | 3°834-14 (166) 
XXIV | Dravidian ... | ., |2°154+°17 (36) |] 10°464 °*72 (35) — 3°334+°29 (31) 
I Chinese... ... |” | 2-19-09 (136) | 10°79 -44 (137) | 5°18+4 +24 (109) | 3°51 4°15 (121) 
XIV_ | Celebes (Natives) ... ” 1219415 (52)| 7-444 +49 (52) | 4294-31 (45) | 3424-23 (49) 
II Burmese 5» | 2°244°12 (84)| 8954 °47 (82) | 3°814°24 (57) | 2°994°17 (70) 
X1X Singalese ... ae » | 22418 (34) 110374 85 (34) —_ — 
XXIII | Hindu: N.E. India "| 2354-08 (179) | 11-17H +41 (173) | 5°O24-27 (78) | 3-44-4713 (163) 
Vil Nepalese 5 | 2°564°18 (46) | 14°484+1°01 (47) | 5°794°44 (39) | 2974°21 (44) 
XXI_ | Punjabi ” |9-874°14 (93)| 1294+ -64 (92) | 7644-56 (43) | 3674-20 (79) 
Comin tate et froks ss Ree 
Weighted Mean (@) » | 2°20 [17] | 10°60 [14] | 5-20 [10] | 3°54 [13] 
XII | Javanese ve ase | Q | UTHAA (84) | 78+ “G4 (34) | 4724741 (30) | 3684-32 (30) 
II Jurmese... vt we | gy | L999#'15 (38) | 849+ “67 (37) — 2°75+°24 (30) 
XXIII | Hindu: N.E. India » | 207414 (50) | 10-464 “72 (48) — 2°96+°21 (46) 
Weighted Mean (a) | 1°96 [3]| 9:13 [3] | 4°72 [1] | 3°13 [3] 
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no conclusion can be drawn. The position is almost the same for the simotic index 
except that the group mean for the African races is appreciably lower than the 
others. It may be seen from Fig. 1 that the lowest simotic indices found are for 
African races. We can only conclude that the variabilities of the indices of facial 
flattening for different races are not peculiar to continental areas and that there is 
no evidence that the variations of advanced and primitive peoples are differentiated 
in this respect. 


(6) Intra- and Interracial Correlations of the Measurements of Facial 
Flattening. All four of the indices considered in the present paper are of the same 
type, as each is formed by expressing a “median” subtense as a percentage of 

a corresponding transverse breadth. It might be anticipated that for both intra- 
and interracial samples they would be quite highly correlated with one another, 
and this could be supposed an expression of the fact that they might all be pre- 
sumed to measure the same feature of the cranium (viz. the transverse flattening 
of the facial skeleton) in rather different ways. For an intraracial sample positive 
correlations are to be expected between all pairs of the absolute measurements, 
owing to the effect of a common size factor*, and it would not be surprising to 
find that some pairs of the breadths and some pairs of the subtenses show high 
positive correlations. The correlations actually found do not coincide with all our 
anticipations in the present case. Table VI gives 72 male and female intraraciui 
correlation coefficients for the longest series available. They range from —*176 to 
+ ‘434 and there are only 29 values (all positive) which can be considered to differ 
significantly from zero. The frontal index is seen to be quite uncorrelated with 
the simotic, no significant correlation being found for any series, while the closest 
association is that between the simotic and rhinal indices which both measure the 
prominence of the nasal bones, all the values being significant in this case. Both 
frontal and premaxillary indices show a more marked, but still low, degree of 
association with the rhinal than with the simotic index. We must conclude that 
the last, which gives a measure of the prominence of the root of the nasal bones, 
is practically uncorrelated in the individual with other measurements of facial 
flattening which do not involve the nasal bones. The same might be true for an 
index confined to the lower extremities of these bones, the rhinal not being of that 
type since the breadth from which it is calculated is between points on the malar 
and maxillary bones. But the correlations between the frontal and premaxillary 
indices are also of a low order and the supposition that all our indices were 
measuring the same feature of the facial skeleton in rather different ways is seen 
to be entirely belied by the facts. The intraracia) correlations of the absolute 
measurements will clearly be of interest and these are given in Table VII for the 
male Javanese series—the longest available—for which the indicial correlations in 
Table VI are quite typical. As might have been anticipated, all the correlations 


* See the standard paper on cranial correlations by Professor Karl Pearson and Miss Adelaide 
G. Davin: ‘On the Biometric Constants of the Human Skull,” Biometrika, Vol. xvi. (1924), pp. 
328—363., 
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Intraracial Correlations of the Indices of Facial Flattening*. 








TF,S TF,R 'F,P Ts,R Ts,P TR, P 
Race Sex | (Frontal and | (Frontal and | (Frontal and | (Simotic and| (Simotic and | (Rhinal and 
Simotic) Rhinal) Premaxillary) Rhinal) Premaxillary) | Premaxillary) 
| 
oy | 
British (Soldiers) ... ae a +°083 + °064 | +°2994-067 | +°2424°063 | +°248+°069 | +°1994°065 | +°2434-073 
i (111) (84) | (101) (84) (98) (76) 
Ttalian . | 4 | —°089+-067 | +°279+°086 | +°2034°066 | +°3934-080 | +°237+°066 | +°288+ 087 
(101) (52) (96) (51) (95) (50) 
Chinese | | £°O83+°058 | +°3014°059 | +°1784°059 | +°3814°055 | +°2424-058 | +°3314-060 
(134) (108) | (121) (109) | (121) | (100) 
Hindu: N.E. India | 4, | #°O764°051 | +°4344°062 | 4+°2434-050 | +°37384°065 | +°1694°052 | +°3714-068 
Po a (78) | (163) (78) (158) (73) 
| Javanese » | + 064+ °052 | +°1534°054 | +°1694°051 | +°3994°047 | +°119+°052 | +°1274°056 
(165) (149) (166) } (143) (163) (142) 
Egyptian: Kerma » | +°002 +069 — +°074 + -067 | — +°086+°071 — 
| (96) (99) (88) 
Gaboon Negro » | #1154078 | —-069+-096 | +°113+°083 | +°2724-089 | +°093+°086 | +°147+-°102 
(72) (49) (64) (49) (60) (42) 
Peruvian » | +°1874°051 | +°141+4°101 | +°253+4°053 | +°413+°085 | +°0204+°056 | +°189+-°106 
(166) (43) (143) (43) (143) (38) 
Maori » | #°1734°070 | +°0354°111 | —°052+°077 | +°351+°096 | +°182+-075 | —-0384°114 
(88) (37) (76) (38) (75) (35) 
Australian: “Other”regions | ,, | —-038+-064 | +°047+-096 | +°082+4-066 | +°282+ -088 | —-024+-067 | +-400+-086 
L | | (May. | (49) (102) (50) | (101) (43) 
| | 
Egyptian: Kerma 9 | —°1764°-077 | | +°141+4°'074 | —-061+-081 | = 
oo .. e | (80) | (69) | 
Gaboon Negro » | —°1444°091 | +°089+°101 | +°007 + °095 1+ 426 +°082 | +°082+°099 | +°1014+°107 
53) (44 (50) (44) (46) (39) 
Peruvian | » | +096 + -065 | 5 | +°156 + -063 = | —-109+-070 | aa 
ae eee (96) (92) 
Australian: “Other” regions | ,, | —°035+-079 | | +-093+ -080 +°142 + -080 | em 
) (70) (69) 


























* The coefficients in italics are the only ones which differ from zero by more than three times their probable errors. 


in Table VII are positive. It has been shown by Pearson and Davin that nearly 
all the intraracial values found between pairs of absolute measurements are 
positive owing to the influence of a common size factor. This could be supposed 
to account for a number of correlations found for the Egyptian series with which 
they were dealing of the order +°1 to +°4+, but when the measurements compared 
also partly “covered” one another higher values were generally found}. Among 


the 28 coefficients in Table VII there are 10 less than 


<9 


= 


and most of these do not 


+ E.g. the coefficients found for the male Egyptian E series are +°3971+-0191 for L and B, 
+3344 + -0203 for B and H, +-2758+ -0229 for G’H and J, and +°2358+ -0218 for NB and L. 
t E.g. the coefficients found for the male Egyptian E series are + °6341+-0137 for B and U, 
+7059 4: 0117 for NH and G’H, and +-+4170+-0199 for B’ and J. 
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differ significantly from zero, while there are only two greater than 5*. The 

different subtenses show no high correlations with one another and the simotic 

chord is lowly correlated with the other facial breadths. We are dealing, in fact, 
with characters which are but little associated intraracially, and the conception 

of there being a single factor of facial features which can be measured in the 

individual in slightly different ways by the different subtenses or indices considered 

is seen to be of no practical utility. 


Since the indices we are considering were devised for the purpose of dis- 
criminating racial types, the interracial correlations are of more importance than 
the intraracial values dealt with above. Unfortunately the material collected is 
quite insufficient to give reliable estimates of the former. Four of the scatter 
diagrams given by the mean values are in Figs. 1—4, and the constants are in 
Tables VIII—X. In calculating the last, all means available based on 15 or more 
skulls were used in order to obtain a sufficient number of series, and it must be 
admitted that the sampling errors of the less reliable means are probably large 
enough to distort the resulting interracial constants. Owing to this fact, the 
standard deviations in Table VIII are probably larger than they should be. Never- 
theless, those for the frontal and premaxillary indices are less than any intraracial 
standard deviations found (cf. Table V), and for the simotic and rhinal indices the 
interracial values are less than nearly all the intraracial. For all these indices 
interracial variability tends to be decidedly less than intraracial variability, and 
the same is true for the absolute measurements (cf. Tables IV and IX) and for 
nearly all of the more usual cranial measurements. The interracial correlations in 
Table VIII are probably somewhat too low, owing to the fact that several unreliable 
means were included in computing them. They are surprisingly different from the 
intraracial values in Table VI. All of the former class are decidedly higher than 
the corresponding coefficients of the latter class, except in the case of the pre- 
maxillary with the simotic and rhinal indices respectively for which the interracial 
and intraracial coefficients are not clearly differentiated. This demonstrates again 
the impossibility of deriving any knowledge of interracial correlation from a 
knowledge of intraracial constants only. The coefficients for absolute measure- 
ments in Tables VII and X can be used to demonstrate the same point. While all 
the values in Table VII are positive, there are several quite high negative values in 
Table X, and it is of interest to note that these are nearly all with the maxillary 
facial breadth (GB). There is a clear indication that in the type the frontal and 
nasal subtenses and the simotic breadth decrease with an increase in GB. 


* The relative values of the correlation coefficients will depend partly on the racial homogeneity 
of the series. The Egyptian E series is known to be more homogeneous than most available series. 
Comparisons with the Javanese can only be made in the case of three male coefficients of variation, the 
values being: 


GB Ss sc 
Egyptian E 4-90-08 (877)  23°1041-64 (50) —-16°844 1-17 (50) 
Javanese 4674-17 85°65 + 1-45 22-754 +87 


GB for Egyptian E is given by Pearson and Davin in Biometrika, Vol. xv. 1924, pp. 328—363, and SS 
and SC by Ryley, Bell and Pearson in the same Journal for 1913, Vol. 1x. pp. 391—445, 
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(7) Racial Comparisons of the Measurements of Facial Flattening. A description 
of the material measured is given in Section (3) above, and the mean measurements 
from which racial comparisons can be made are in Table I. Enough series are avail- 
able, representing all the principal groups of modern races, to give a definite 
estimate of the value of the new criteria for anthropological purposes. Figs. 1—4 
give scatter distributions of the mean indices taken in pairs, and no detailed 
descriptions of, or comments on, them need be made. But it is necessary to supple- 
ment the diagrams by discussing the order of differences which may be considered 
to indicate statistical differentiation in the case of each index. 


Fig. 1 shows the interracial correlation of the rhinal and simotic indices. All 
series for which both means are based on 15 or more skulls are shown, but the 
majority of them are made up by considerably more than 15 individuals, the actual 
uumbers being given in Table I. In order to obtain an estimate of differences 
which are statistically significant, it will be sufficient to consider the 11 male 
European series which are all in the top right-hand corner of the diagram. Taking 
the rhinal index first, it is found that out of the 55 possible differences between 
the means, 31 are less than 2°5 times their probable errors, 9 show values of the 
ratio of the difference to its probable error between 2°5 and 40, and for the 
remaining 15 the ratios are greater than 40. If we suppose that a difference is 
significant when it exceeds 2°5 times its probable error, then by the rhinal index 
the Dutch series is differentiated from all the others except the Russian; the 
Spitalfields and Mediaeval English from all except one another and the Farringdon 
Street English; while the Italian and Farringdon Street English series are also 
differentiated. Six of the series—viz. the Russian, Anglo-Saxon, Scottish, Irish, 
Romano-British and that of British Soldiers—cannot be distinguished from one 
another by their rhinal indices, and neither the Italian series, on the one hand, nor 
the Farringdon Street English, on the other, can be distinguished from the same 
group. Turning to the simotic index, it is found that 36 of the 55 differences are 
less than 2°5 times their probable errors ; 9 show ratios between 2°5 and 4-0, and 
the remaining 10 show ratics greater than 40. Taking the limit again at 2°5, 
the Italian series is differentiated from all others except the Russian, the 
Spitalfields and the Mediaeval English; the Spitalfields is differentiated from 
all others except the Italian, Russian and Mediaeval English; and the Mediaeval 
English is differentiated from all others except the Italian, Russian, Spitalfields 
and Farringdon Street English and the Romano-British series. Eight of the 
11 series—viz. the Dutch, Russian, Anglo-Saxon, Scottish, Irish, Romano-British, 
Farringdon Street English and that of the British Soldiers—thus show no significant 
differences between their mean simotic indices. If larger numbers of individuals 
were available, it is quite probable, of course, that some of the mean simotic indices 
for these racial types would be differentiated. For the material available, however, 
it is quite evident that statistically significant differences are found between points 
which are separated, in the case of either variate, by distances which are a small 
fraction of the total range shown by the same variate. It will be quite safe to 
assume, without further examination, that all the European series differ most 
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significantly from all the Oriental and African negro series in the case of both rhinal 
and simotic indices. 


The frontal and premaxillary indices, used in Fig. 2, may be treated in the 
same way, and it will be sufficient to restrict attention to the group of five male 
Indian series at the top of the diagram. For these, oniy three of the possible ten 
comparisons of mean frontal indices show differences in excess of 2°5 times their 
probable errors, and the greatest ratio found is 3°0. The Singalese series shows 
a difference from that of Hindus from Madras which may be considered just 
significant, and the same is true for the Punjabi series from those of Hindus from 
North-East India and Madras respectively. More significant differences are found 
between the mean pre-maxillary indices, only three of the ratios being less than 
2°5, five being between 2°5 and 4:0, and two being greater than 4°0. Both the means 
for Singalese and Hindus from North-East India are differentiated from all for the 
other three series. All the mean frontal indices of the Indian series will clearly be 
differentiated most markedly from ail the means for the Oriental series, and quite 
markedly from nearly all the European means. The premaxillary index is less able 
to differentiate these three groups of races, but significant differences for it will be 
shown between means which are separated by quite a small fraction—less than 
one-tenth, say—of the range shown by all races in the world. 


The fact that the last statement is true for all four indices indicates—since 
many of the series with which we are dealing are very small—that the indices are 
well-defined racial characters and their uses for purposes of classification may now 
be considered. The characters of this kind with which anthropologists are familiar 
may be divided somewhat arbitrarily into the following classes: 


(a) Characters which appear to be of little use for any purposes of racial 
classification, although they may differ quite markedly from race to race. 


(b) Characters which serve to differentiate and provide suggestive arrangements 
of racial types belonging to any single family of races, but which are practically 
valueless for the purpose of classifying the different families of races. Such are, 
notably, the cephalic index and stature, the ranges for these two being almost as 
great for European races, for example, as for all races in the world. 


(c) Characters which are practically constant for races belonging to any single 
family of races, but which provide suggestive orders when the different families are 
compared. Such are, notably, skin colour, the nasal index and the nasal angle 
or other measurements: of prognathism. Characters of this kind alone are 
capable of approximately grading all races in the world in order of their 


“primitiveness *.” 
Fig. 1 suggests forcibly that both the rhinal and simotic indices belong to the 
last of these classes. The interracial correlation between them for male means was 
* See comments on this point by G. M. Morant in Annals of Eugenics, Vol. 11. (1927), pp. 334—336, 


where it is concluded that the nasal index and nasal angle are the best fitted of the usual cranial 
characters to serve the purpose mentioned. 
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found to be +°868, and it is clear that the orders in which the two considered 
singly will arrange the races are very similar. African negro types have the most 
depressed nasal bones, then the Oriental races, which are remarkably similar to one 
another in this respect, follow and lead on to the American, Oceanic and Egyptian 
groups which are not distinguished from one another; the three Indian series all 
have more prominent nasal bones, and finally come the European types with the 
most prominent to be found in the world as far as our evidence can tell. This 
character of nasal prominence evidently tends to be fairly constant for races 
belonging to the same family—such as the European, Oriental or African negro 
groups—while it makes some very clear inter-group di.‘inctions. Anyone un- 
accustomed to dealing with quantitative anthropological characters in this way 
may fail to appreciate the significance of the diagram we are considering. The 
vast majority of the measurements of size or shape usually provided fail entirely 
to supply any suggestive arrangements, when considered singly or in pairs, in the 
case of a sample drawn from all parts of the world. The cephalic index or stature, 
for example, will give an apparently meaningless jumble for such a sample of racial 
types. But the rhinal and simotic indices appear to be as capable of making inter- 
group distinctions as any other craniological characters known. The only other 
ones which can compare with them in this respect are the nasal index and the 
nasal angle or other measurements of prognathism. These last two are known to 
be highly correlated interracially with one another*, and it is probable that both 
are highly correlated with the rhinal and simotic indices. But the measurements 
of nasal prominence seem to give rather more suggestive orders than either the 
nasal index or angle+. The rhinal subtense was found to be rather an unsatisfactory 
measurement, since the tips of the nasal bones are so often broken, and it is some- 
times difficult to decide whether they are defective or not. Since the resulting index 
has now also been shown to be highly correlated with the simotic index we do not 
recommend its further use. 


The order in which the mean frontal indices arrange the racial types can be 
appreciated from Fig. 2. This is still a suggestive one, though it has evidently 
not the significance of those given by the rhinal and simotic indices. The Indian, 
European and Oriental groups are kept quite distinct and the African negroes 
occupy a restricted range between the last two. It is interesting to note that the 
Indian series all fall at one extreme of the distribution, while all belonging to the 
second Asiatic group—the Oriental—are at the other extreme. The primitive 
Oceanic and African negro series occupy intermediate positions and several of them 
are not differentiated by this character from European types. It is evident that 
the frontal index can give no indication whatever of “primitiveness.” The position 
with regard to the premaxillary index is similar to that found in the case of most 
anthropological characters. It has been shown above that a small fraction of the 
total range may be taken to indicate a significant difference, so the index is as 
capable of differentiating races as most measurements. But it fails entirely to 





* Morant found a coefticient of +*747 between the nasal index and nasal angle, loc. cit. p. 336. 
+ Cf. our Fig. 1 with the tables of nasal indices and angles given by Morant, loc. cit. pp. 335 and 336. 
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arrange them in an order which can be supposed to aid classification. No two of 
the principal families of races are distinguished and the range for African negroes 
is almost as great as for all races in the world. The premaxillary index is seen to 
be the least valuable of the four we are considering for purposes of inter-group 
comparisons and, like the others, it does not provide any interesting intra-group 
arrangements. It thus falls into class (a) defined above. What interest there is in 
the arrangement shown in Fig. 2 is given to it by the frontal index alone. 


Fig. 3. shows the interracial correlation of the two most suggestive indices— 
the frontal and simotic. The coefficient between them for male types was found to 
be + 699. Four of the principal families of races are seen to occupy discrete areas 
with somewhat of a jumble of Oceanic, American and Egyptian types falling 
between them. It may be doubted whether any other pair of craniological characters 
that have hitherto been investigated can effect a more inceresting arrangement 
than this. The compactness of the points representing the Oriental series and 
their wide separation from the Indian group are particularly interesting facts. It 
has been shown by coefficients of racial likeness that when all the more important 
of the usual craniometric characters are considered together no sharp line of 
division can be drawn between these two Asiatic families of races, since the Nepalese 
type closely resembles that of Hindus of Bengal, on the one hand, and Tibetans of 
the A type on the other, while the last has close affinities with several Oriental 
types*. The relationships suggested by Fig. 3 alone are entirely different from 
these; the Tibetans A are as widely separated from the Indians as any other 
Oriental series, and, though the Nepalese point does diverge slightly in the direc- 
tion of the Indians, it might well be considered a true member of the Oriental 
group. The frontal and simotic indices thus provide a means of distinguishing the 
two Asiatic families of races in the clearest possible manner, although a considerable 
number of other measurements fail to make a sharp division between them. This 
example warns us not to attempt to base any scheme of racial classification on the 
evidence of the two indices alone. They clearly provide valuable indications for the 
purpose in view—perhaps as valuable as those which can be given by any pair of 
characters known—but racial relationships can only be estimated by taking into 
account the evidence provided by a considerable number of characters which may 
not be highly correlated interracially with one another. It would be a gross mistake 
to assume that the New British are closely allied to any Oriental people merely 
because its point falls well inside the restricted area covered by all the Oriental 
races in Fig. 3, though mistakes of that kind are not uncommonly made. 


Fig. 4 shows the interracial correlation of the premaxillary and simotic 
indices and it is obviously of far less interest than Fig. 3. The fact that the 
African Negro, Oriental and European groups are separated is seen to be due 
almost entirely to the separation made between them by the simotic indices. The 
premaxillary index may be supposed to give a measure of sub-nasal prognathism 


* See T. L. Woo and G. M. Morant: ‘‘A Preliminary Classification of Asiatic Races Based on 
Cranial Measurements.” Biometrika, Vol. xxiv. (1932), pp. 108—134. 
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THE INTERRACIAL CORRELATION OF PREMAXILLARY AND SIMOTIC INDICES. 
All meas used are based on 15 or more skulls. 
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and the location of the Tasmanian and Australian series at the extreme end of the 
range would hence have been anticipated, but it is surprising to find that both 
Teita Negroes and Bushmen have lower means than all the European series available. 
This index does not appear to be of much value for purposes of racial classification 
and hence its further use cannot be advocated. 

Finally, we may consider racial differences of the absolute measurements from 
which the four indices are derived. The rhinal subtense and chord may be omitted, 
since they can be given for considerably fewer series than can the other measure- 
ments, and the premaxillary subtense is not of special interest. The breadth (GB) 
associated with that subtense is included, however, since it is a measurement which 
has been given for far more series than the new ones taken for the purpose of this 
paper. The ranges for different groups of races given in Table XI are found from 
the data given in Table I. The frontal subtense (Sub. JO W), for the data available, is 
seen to make an absolute distinction—i.e. to show ranges which do not overlap— 
between the African Negro and Oriental, African Negro and Indian, Oriental and 
European, Oriental and Indian, and Oceanic and Indian groups, while precisely 
the same distinctions and no others are made by the mean frontal indices (see 
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Fig. 2). But the frontal breadth (IO W) is only able to distinguish the African 
Negro and Indian groups. The simotic subtense makes absolute distinctions 
between the African Negro and European, African Negro and Indian, Oriental and 
European, Oriental and Indian, and Oceanic and European groups, while the mean 
simotic indices make all these distinctions and also one between the African Negro 
and Oceanic groups as well (see Fig. 3). But the simotic chord is not able to 
distinguish any of the groups. These rough comparisons suggest forcibly that the 
" frontal and simotic indices are valuable racial criteria by virtue of differences 
between the subtenses, but not between the chords involved. The case is different 
for the premaxillary characters, which were not valuable racial criteria, since the 
index makes no distinctions between any of the groups (see Fig. 2), but the 
breadth (GB) differentiates the Oriental group from both the European and Indian 
(see Table X1). 





TABLE XL. 


Ranges of Male Mean Absolute Measurements (all based on Fifteen or more Crania 
for different Groups of Races*. 





Racial Groups Sub. OW SS IOW SC GB 














African Negrot | 17°1—18°6 (8) | 2°2—2°9 (8) | 98°6--103-0 (8) | 8'5— 9°5 (8) | 93°9— 98-5 (7) 
Orientalt — ... | 15°1—16*5 (14) | 2-4—3-4 (14) | 94°3— 99°5 (14) | 7-~6— 9-6 (14) | 97-4—101°5 (12) 
Oceanic ... | 16°0O—18°8 (16) | 2°7—4°1 (16) | 97°3—101°8 (16) | 6°5— 9°3 (16) | 89°8—102°6 (15) 
European... | 17°7—19°3 (19) | 4-5—4-9 (18) | 96-6—100-0 (19) | 8°6—10-3 (18) | 91-7— 97-2 (18) 
Indian § ... | 18°9--20°2 (7) | 3°7—5°0 (7) 




















93°2— 97°6 (7) | 85— 9-8 (7) | 93-0— 96-9 (6) 





* The American series were omitted since very few are available, as are some others, such as the 
Egyptian series, which cannot be placed in any of the groups included. The numbers in brackets are 
the numbers of series from which the extremes were deduced, 

+ Excluding the Bushman series. + Excluding the Andamanese but including the Celebes series, 

§ Including the Afghan series. 


8. Conclusions. The transverse “flatness” of the facial skeleton has long been 
recognised to be a valuable criterion for the purpose of aiding the discrimination 
and classification of the races of modern man. This research was undertaken with 
the object of examining it by quantitative methods on an adequate number of crania. 
Four pairs of absolute measurements were taken, whenever possible, on each specimen 
giving four indices which express a “median” subtense as a percentage of a corre- 
sponding transverse breadth. One of these indices—the simotic—had been used 
previously and the others are termed the frontal, rhinal and premaxillary indices, 
All the subtenses were found with a pair of co-ordinate calipers made for the purpose. 
The material measured (preserved in various English and one Dutch museum) 
consists of 4266 male snd 1630 female specimens divided into 83 series which 
represent all the principal groups of races of modern man. An analysis of the 
statistical constants leads t» the following conclusions: 


vr 





(i) Five of the eight absolute measurements show sex ratios of the usual order, 
but for the simotic chord the ratios are small and for the simotic and rhina! sub- 
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tenses they are particularly large. The simotic and rhinal indices hence show marked 
sexual differences between means, while the premaxillary index shows slight sexual 
differentiation and the frontal none at all. The nasal bones are, on the average, 
more protruding in the male than in the female of the same race, but the pre- 
maxillary region tends to be less protruding in the male. 


(ii) In the case of the indices male variation shows a distinct tendency to be 
greater than female, but for absolute measurements—judging by coefficients of 
variation—female variation is either greater than male, or else there is sexual 
equality in variability. 

(iii) There is no evidence that the variabilities of the characters examined 
differentiate “primitive” and “advanced” races. 


(iv) The intraracial correlations of the absolute measurements and indices are 
decidedly lower than would have been anticipated. 


(v) The interracial variabilities tend to be decidedly smaller than the intra- 
racial variabilities. 


(vi) The interracial correlations of the indices are clearly differentiated from, and 
they tend to be decidedly higher than, the corresponding intraracial correlations. 
The correlations of the two classes in the case of the absolute measurements show 
many marked differences. 


(vii) All four indices show significant differences between the majority of the 
means that are available, so they are well-marked racial characters. 


(viii) While showing no marked differences between races belonging to the same 
family, or group, of races, the rhinal and simotic indices—which are highly correlated 
interracially—appear as capable of making inter-group distinctions as any other 
craniological characters that have been examined (see Fig. 1). 


(ix) The frontal index also shows small intra-group differences, and it makes 
several clear inter-group distinctions, though these are not so well-defined as those 
given by the rhinal and simotic indices (see Fig. 2). 


(x) The premaxillary index fails to make distinctions between the different 
groups of races and it does not provide any intra-group arrangements to which 
significance can be attached (see Fig. 2). 


(xi) There would be little advantage in determining the rhinal and premaxillary 
indices for larger numbers of skulls, but the simotic and frontal indices are valuable 
characters which should be recorded in the routine descriptions of racial series of 
crania. The arrangement provided by the last two (see Fig. 3) is as interesting as 
that given by any other pair of cranial measurements. Some distinctions are made 
by the simotic and frontal indices which are not effected by any other cranial 
characters that have been investigated. 


(xii) It is shown that the simotic and frontal indices are valuable racial criteria 
by virtue of racial differences between the subtenses—but not between the chords— 
from which they are formed. 














MISCELLANEA. 


(i) On certain Measures of Dependence between Statistical Variables. 
By J. F. STEFFENSEN, Copenhagen. 


1. Ina lecture delivered at “Institut Henri-Poincaré” and published in the Annales of 
that Institute (1933, pp. 319—331) I have suggested a modification of Karl Pearson’s Mean 
Square Contingency. The object of the modification was to present the measure in such a form 
that it remains applicable also when the number of distinct values which the: atistical variables 
can assume becomes infinite. It has been pointed out to me recently by Mrs Pollaczek that 
the measure in the form I had proposed does not always, as I had somewhat incautiously 
asserted, assume the value 1 in the case of complete dependence between the variables. A slight 
modification suffices, however, to meet this objection, and the measure is thus reduced to its 
original form which I communicated to Professor Cramér at the beginning of 1930 without 
publishing it. In the present Note I intend to establish the measure from first principles in its 
correct form, and thereafter to propose a new measure which presents certain advantages over 
the other, 





2. With a view to comparisons with the lecture, it is preferable to retain the same notation, 
and as this differs somewhat from notations employed by other authors, I shall begin by explaining 
the symbols preferred. 


Let there be a compound event depending on two statistical variables x and y of which must 
assume one of the distinct values 
Oh oe eee 


and y one of the distinct values Yry Yo Ys» +> 


“9 

each with a certain non-vanishing probability, the sum of the probabilities being unity for each 
sequence of values, The number of values may be finite or not, but it is assumed to exceed one 
in each sequence, as otherwise w or y would not be a statistical variable. 


The probability that 2 assumes the value x; and y the value y; is denoted by pj. The 
probability that x assumes the value x; irrespective of what value y assumes is obtained by 
summing p;; with respect to all 7, A summation of this nature is denoted by replacing the letter 
with respect to which the summation is performed by an asterisk. We thus have 

Pix == py and similarly py; => piy- 
j i 


We also note Pax => Pj == Pix= = Py s=1. 
v i j 


It was assumed that p;, and py; cannot vanish; they are also smaller than unity, because 
there are more than one possible values for each of the variables. We have, therefore, 


0< pix <1, 0 < py; <1 Cee ereeceeceneveeeeseeetseeeceseeeteees (1), 
for all values of ¢ and j. 


The probability that y assumes the value y; when tt ts known that « assumes the value 2; is 
denoted by »4);. We thus have 
Pigs = Pix Paz oo oe eeecneenccccccccccccsvccccceeeesscsecssesees (2), 


and similarly Pag Digg Delp) '~ss es aseascicecsusdives Ceckucgecsucbeesteuceues (3). 
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The extreme case where x and y are completely independent of each other is characterised by 
the relation 
Pig= Pit Pai noeeneccceceeeeeeeccececccececeecccsescsenssees (4) 
valid for all ¢ and j. bec 
The other extreme case occurs if one of the statistical variables is a function of the other. In 
this case, each value of x is always associated with the same value of y, and we may therefore, 
without loss of generality, associate x; with y;. The condition of complete dependence may then 
be written 
Pi=Vig=Peis Pyg=O for CAF ......ccocccccccccrcccsccsscccces (5). 


3. After these preliminaries, we may proceed to the measure. It is denoted by yp and defined 
by the relation 





$i?= RES eerie teters (7). 


where 


We propose to show that y? possesses the following properties : 

(I) wis always comprised between 0 and 1. 

(II) wy’? vanishes if the variables are completely independent and only in that case. 
(III) y? assumes the value 1 in the case of complete dependence, and only in that case. 


As regards the property (I), we observe that y? is an arithmetical mean, with positive or zero 
coefficients, of the quantities ¢;;7. It therefore suffices to prove that ¢;;<1 for all values 
of ¢ and /j. 


Now, in the cases where pj; 2 pix Pj, We write ¢,;7 in the form 





oi2= Pig Pix Pai Pij— Pix Pxj (8) 
i Si eae. “haccecancnansusecrsgevenseeenasus : 
” Pie (1 — Pai)” Pai (1 Pix) 
As pij S Pix, we have Pii— Pie Pj S Pin (1 — Pai), 
so that the first factor in (8) cannot exceed unity. 

Further, as p;; S py;, we have 

Pi — Pix Pas S Pus (1 —Piee)s 

so that the second factor in (8) cannot exceed unity. We therefore have ¢,;? <1, if pi; 2 pix Px;- 

Considering, next, the cases where p;; = px Py; We write $;; in the form 

dyin Pie hed— PY — Pier et Pe csscussvenesavsnsetessees (9). 
: Pix Pas — (1— Pin) (1— Ps) 


Here, the first factor obviously cannot exceed unity. As regards the second factor, we begin 
by observing that 
Pix—Pij S1— Peis 
because the right-hand side is obtained by summation of the left-hand side with respect to 7. We 
have therefore 


Pix Pai — Pig 31 — Pine — Pj t+ Pie Pri 
or Pix Pai — Pig S (1 — Pix) (1 —Pas)s 
so that the second factor in (9) cannot exceed unity. 
We therefore have ¢$;;? <1 also if p;; = pix py; ; and hence, in all cases, 


Proceeding to the property (II), it is obvious that y* vanishes if pj;=p;4 p,; for all values of 
i and j, that is, if the variables are completely independent. 
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In order to prove the converse proposition, we assume that y? vanishes, and have to show that 
then p;;=Pix Px; for all values of i and 7. We begin by proving that if y?=0, no p,; can vanish*. 


Let us for a moment assume that, for instance, p,,=0; this assumption leads to a contradiction - 
if y?=0. We have, in fact, 
~ (Pij — Pix Pas) =, 


and as the term (Prs— Prx Pxs) 


is negative, because p,,;=0, there must be at least one positive term in the sum. Let, for instance, 
the term 
(Pam —Pnx Pem) 


be positive. In that case @,,,2 does not vanish, and y* contains therefore the positive term 
PrmPnn- Therefore y? >0; but we had assumed y?=0, and we have thus arrived at a contradiction. 


As no p,; can vanish, if y?=0, it follows from (6) that all ¢,;? vanish if ¥?=0, that is, we have 
Pi = Pix Px; for all values of ¢ and /j. 


Turning, finally, our attention to the property (III), we first have to prove that if the 
dependence is complete, then ¥?=1. Now, making use of (5), (6) reduces to 





(pi — Pis”)* 
> ii = w=. 
5 Pit os (1 — pia) Piz (1 — Pad) rt 


Conversely, let us assume that y?=1, that is 
> py Pi? =1. 
ii 
As 3 p=1, and as ¢;;7 < 1, this relation cannot hold, unless ¢;;?=1 for all values of 7 and 7 for 
vu 


which p;; does not vanish. Now, if p;;> 0, we cannot at the same time have pj < pig py; and 
,’=1, as the first factor in (9) is less than unity. We therefore have p;; > pix ~4;, and as each 
of the factors in (8) must equal unity, if $;?=1, we have pj=pig=py;, that is, y; occurs 
certainly, if 2; occurs. 


These considerations show that if Y°=1, then there is complete dependence between the 
variables. 


4. In order to approximate to y by means of a given experience, we replace the probabilities 
by the corresponding relative frequencies. Let H;; be the absolute frequency of the combination 
(xi, Y;)) and let us write 

sag 5, Hyj=% H;, N=Hex, 


so that V is the total number of observations. If ¥ denotes the approximation to y obtained in 
this way, we find by (6) and (7) 
1 Hj NH — Hig 145” 


y= — ——— id ais SAV ERE SOANER ESOS RERUN 
N u Hy. (N— Hix) H,; (N — H;) ( 











For the reasons explained in the lecture, no attempt has been made to estimate the mean 
error of this expression. 


5. The calculation of ¥ by (10), although not prohibitive, is more laborious than desirable, 
and I have therefore endeavoured to establish another measure which leads to simpler calcula- 
tions. The new measure which I shall denote by @ is defined as 





2 |Py—PixPsil (11), 


a> =F 


(Pi — Piz") +2 Pin Pas 


* I owe this observation to Mr N. P. Bertelsen with whom I have discussed various points in this 
paper. 
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where = denotes summation with respect to all 7 and j, summation with respect to all ¢ and 7 for 
which pj > Pix Pj, and = summation with respect to the remaining 7 and j, that is, the values 
for which p;; S Pix Px;- 

It can be proved that possesses the same three fundamental properties (I), (II) and (III) 


as wy. Before proceeding to the proof, it is useful to point out that may be written in various 
other forms. For the sake of argumentation it is preferable to write 


_ > (Pij— Pix Pai) +2 (Pin Pei — Pu) 
= (Dis — Piz”) + 2 Pie Pas 





@ 

Further, as 

= (Pi — Pie Pxi)=9, 

the aggregate of positive terms in this sum must cancel the aggregate of negative ones; hence 

(11) may be written Fe 

ox? — > Pii— Pix Pui) 
> (Dig — Piz?) +E Pia Pai 
If, in this expression, we eliminate = by the relation S=5+= and observe that = p;4 ~y;=1, 
We oviain 





___ 3 (Pig- Pie Pas) 
3 (Pij — Pix Pas) t1-Spi? 
The convergence of all the sums is obvious, as 

>| Ps — Pix Paxil < = Pig +2 Pie Pas=2; 
> (Pig — Py?) +2 Pin Pas <2 Pit Pix Pui =2, 


o=2 


and te 
= Piz? < =pPi= a. 
As regards the proof of the first property, or (I), we observe that pj; S$ pix, Pij SPyj, 80 that 


Pi? < Pix Pxj- It follows that each term in the nominator of (12) cannot exceed the corresponding 
term in the denominator, so that 0 <@ <1. 


The second property (II) is obvious, as (11) shows that @ vanishes if p,;;=p;, py; for all values 
of 7 and j, and only then. 


In order to prove the third property (III), we must first show that #=1 if there is complete 
dependence between x and y. Now in that case we have, according to (5), p,=0 if 7 + j, and 
the corresponding terms which belong to = in (12) become equal in nominator and denominator. 
If, on the other hand, i=j, we have pi;=piy=p,;, and the corresponding terms which belong to 
= in (12) become equal to (p;;—;2) both in nominator and denominator of (12). Hence o=1, if 
the dependence is complete. 


We finally have to prove that if #=1, then the dependence is complete. But, observing that 
all the terms in the sums in (12) are positive or zero, and that each term in the nominator cannot 
exceed the corresponding term in the denominator, it becomes clear that, if o=1, then each term 
in the nominator must be equal to the corresponding term in the denominator. Hence, we have 
either pyy=0 or else pi? =piy pyj; but on multiplying (2) and (3) with each other it is seen that 
we can only have p;,;7=p; py; if pyyj=1, pyj)=1, so that 

Pii=Pix=Pxi- 
The dependence is, therefore, complete. 

6. In order to approximate to » by means of a given experience, we replace, as in paragraph 4, 
the probabilities by the corresponding relative frequencies. The simplest formula to employ for 
the numerical work is (14). If @ denotes the approximation to obtained in this way, we find 

3 NH; -3 Hix Ay; 
DD SF ete teentecenecceensnenscee es 15 
SN Hy — 2Hiy Hy; + N*-3 H,;? “ (15), 


where = means summation with respect to all values of i and j for which VHy > Hy Hy;- 
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It is seen that the calculation of @ by (15) is considerably easier than the calculation of ¥ by 
(10). 

7. In order to apply the measures y and @ to continuous probabilities, we choose a constant 
interval / for x, and a corresponding interval & for y, and put 


GK+K fihth 
pu= ff. se gaedy, 


th+h [jk+-k 
pe=f? S@ *) dz, pu-fe FT (*,y) dy, 
ih Sd 


where the asterisk means integration over the whole range. Inserting these values in (6) and (11), 
it is seen that the values of w and » depend on the values chosen for 4 and s. This dependence 
is very pronounced in the case of yf, less so in the case of , so that @ is also in this respect 
preferable to w. 


An entirely satisfactory measure in the case of continuous probabilities has not yet been found, 
as far as I ain aware. The measures proposed by H. Cramér* and by Mrs Pollaczek+ are subject 
to inconveniences to which these authors themselves draw attention. 


(ii) Remarks on Professor Steffensen’s Measure of Contingency. 


EDITORIAL. 


Let us consider a bivariate contingency table in which m,; individuals are in the ¢j internal 
cell, and for which m;. individuals fall in the ith cell of one marginal total and m.; in the jth 
cell of the other marginal total. 


Let M= Si; (mj) = S; (m; -) = S; (m <a 
Consider the expression 


m;.m.;\* 
M5 (ma; - M ) 
sa = Si; ~ (i). 


ma Saba Sera as 
Mmj;.m.; (1 = 7) ( ae y) 


Then from (i) it is possible to show that, if 





(a) mj=m;.m.;/M for all values of i and j, then the two variates are independent, and 
yW?=0, and conversely if y2=0, the variates are independent ; 

(b) mij=m;.=m.;, the two variates are such that if a value of one occurs in the ith 
marginal cell, the corresponding value of the second will always fall into the jth marginal cell, 
and the value of y? will be 1, and conversely if the value of ¥* be unity, then the above relation 
holds ; 

(c) the value of y? always lies between 0 and 1. 

Now let us write pjj=m,;/M and pj.=m;./M and p.;=m.,/M, then (i) becomes 
=o Din wae 
= §,, —Pu (Pij— Pi-P 5) aah ben Saakeavobaarou sen eyeenuebeed (ii). 
v " p;.p.; (1—pi-) (l—p-y) 
(ii) is clearly in form identical with Professor Steffensen’s y* value. 
We may define p;; as the chance of drawing an individual out of the yth cell and p;. and 


p.; a8 the chances of drawing individuals from ith and jth marginal cells, when an individual 
must be returned, before a second drawing, to the cell from which it has been extracted. 


* Skandinavisk Aktuarietidskrift, Vol. vu. (1924), p. 231. 
+ Zeitschrift fiir angewandte Mathematik, Vol. 13 (1933), p. 122. 
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Now there seems in this method of approaching the subject no restriction on the number of 
cells or on the smallness or largeness of the p’s, or it seems allowable to pass to differentials. 


Suppose then we have a frequency surface of x, y variates, where 
a= Mf, (x, y), 
and for the distributions of x and y alone, i.e. marginal total curves, 
%=Mfr (x), 2%=Mfs(y). 
We can take m= Mf, (x, y) dx dy, 


m;.= Mf, (x) dx, m.;=Mfs(y) dy, 
and accordingly 
yas, 2a hile D(file N-f(2) fy)? 
7 fa (#) fs (y) dedy (1 — fy (x) dex) (1 — fs (y) dy) 
—[ fae2 dye SAG D(A N-Le(*)fa(M"| +i 
=| favs dy { Aw frly) } in the limit. 


The expression in curled brackets will be finite and accordingly the integral, owing to the 
presence of dx?dy? instead of dxdy, will vanish, however close or loose the bond between x 
and y. 








The mean square contingency as I define it is given by 


m;.m.;\? 
Mj = 


Mj .M. ; 
U4? 


M 
= g,, (Pii~ Pi: P +5)” 


Wy 
Pi-P +5 


— [pray fils Naw) fa Wy 
= |[aedy A@OAY) 


and remains finite when we pass to the limit. My coefficient of mean square contingency, C2, is 
given by 
¢* 
ey eee AEN 
“NM 14+¢?? 


j . 1 P ica he 
and this always lies between 0 and J a where 2 is the number of cells occupied if there 
D 


in the limit, 


be only one occupied cell in each row and in each column of the table. Only when ~ is indefinitely 
large, does C, range from 0 to unity, for example in cases of continuous distribution. So far from 
considering this a disadvantage of C,, I hold it an essential property of a contingency coefficient ; 
for as long as the cells are not indefinitely small, it is possible to arrange the individuals in them 
so as to increase the contingency. J cannot therefore hold that in choosing y? in the form (i) 
instead of C,, we should gain any advantage from the fact that its upper limit is unity. I do 
not think a good coefficient of contingency ought to be unity until we introduce the conception 
of continuity, namely every y having attached to it a definite x, so that y is a continuous function 
of x. The fact that Professor Steffensen’s y? is necessarily always zero when we pass to con- 
tinuity, suggests that it will take a very low value in cases where we should expect a high value 
of the variate relationship. Indeéd, the presence of the factor m;; in the numerator of (i), or of 
pi; in Professor Steffensen’s expression, means there can be no contribution to y? from the zjth 
cell when m;., m.j;, or p;., p.; are finite but p;; is zero. This seems a serious disadvantage in 
practical working, for if m;. and m.; are considerable, and yet m;; zero, we should expect a con- 
siderable addition to the contingency, which is not provided by the Steffensen w*. 








| 
| 
| 
| 
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Illustrations. 


(1) Consider the contingency Table I. y* based on the 28 occupied cells is equal to “12833. 
¢* based on the 100 cells which might be occupied with these marginal probabilities is equal 


to 2°983,333 and the Coefficient of Mean Square Contingency C,.= ef $ 








2 
we 
TABLE I. 
-123 3 
-12°2 5 
“sy * 4 
te ie 4 
oe i ae Na ° 4 
-121> 4 
-121-> 4 
121-> 4 
221. . 5 
21- 3 
4444444444) 40 








Surely a high degree of relationship might be anticipated from such a table; but yw? gives it 
a comparatively low place. 


(2) Now let us introduce more rows of 1, 2, 1 so that instead of 10 fours in the horizontal 
marginal total we have 15, Table IT shows the distribution. Here y?=-138,003 and ¢?= 4°858,333, 
giving a Coefficient of Mean Square Contingency C,=V (1+@*)="91066. Thus both y? and C, 
have risen, but y? remains low and Cy is very high. The latter is in accordance with what we 
should expect, if this were a correlation table for continuous variates. 


TABLE Il. 








| - $er 3 

| - -129229] 5 

e -121-+] 4 

& -121- 1 

x -121- 4 

| - 121: 4 

| - 121: 4 

| we (5: oe ee 4 
| 121+ 4 
-121-- 4 

-+121 4 

-121> 4 

-121 4 

221: 5 

21> 3 

444444444444444] 60 








(3) We will now test what happens if we increase the contents of the central cell in the rows. 
Table III gives such a distribution. Here Stetfensen’s y? rises with a leap to °500,940 while my 
Coefficient of Mean Square Contingency C,=-9374. This result is not very informative. It 
indicates that y* is very sensitive to increase in the central probabilities, which we pushed up 
from ,; to ;};, and to decrease in the other probabilities which we have reduced from gy to y}p- 
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TABLE III. 


_— Oem 
o 8 
_ 
i 


Su Lace es l 


101010101010101010101010 | 120 











But it does not in itself render obvious whether a rise from ‘1380 to ‘5009 is a more reasonable 
change than one from ‘9107 to ‘9374. 


We will consider these changes on another series of tables. 


(4) Consider Table IV. Here @ priori I should anticipate a very considerable degree of 
relationship. Steffensen’s y~?=-20139. The Mean Square Contingency, ¢*, =2 and C,=°81650. 
Thus the y? is small and the C, very considerable. 


TABLE IV. 


tt et et et 





(5) We will now increase the number of rows of ‘5, ‘5 until we have 99 of them with a ‘5 as 
the first and last row. Accordingly there will be 101 rows and 100 columns. Again I should 
anticipate that such a table must make a high degree of relationship. wy? =*24750, while C,= 98094. 
My opinion is that wy? gives a far too low measure of the degree of relationship of the two 
variates, and its values are not well distributed throughout the range 0 to 1. 


TABLE V. 


) ) 
5 5 | 
el te 5 as | 

‘5, ‘5 in 99 rows 
altogether . 
+ eS SS 1 
5 5 l 
H - 9) 











1 OME, 








OES, 
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(6) Let us now see what happens when we increase the scatter. We will take four “5’s to a 
row. Table VI suggests a high degree of relationship, if it were based on a continuous variation. 





— 


TABLE VI. 
° § 5 
- 55] 1 
- 5 5511S 
- 5 5 5 5 2 
* 6S O. 2 
- 5 5 5 5 + 2 
-56556 5 - 2 
56°55 5 - 2 
- 555 5 2 
5 GS * 2 
Bi 5 
: 1 
5 











Steffensen’s y* sinks to 04770, while my ¢?=2°25, giving as Coefficient of Mean Square 
Contingency C.=°83205. We shall find that this drop down of the w? as we get lower and lower 
probabilities is the influence of the ;; term in the numerator, and that, if we reach continuous 


variates, the influence of the approach of dxdy to dxdy and zero would be more and more 
conspicuous. 


(7) Table VII shows a still greater increase in the scatter. We should anticipate lower 
values of w? and C., but still there is a very considerable degree of relationship between the two 


variates. We find y?='01916 and ¢?=1°5, giving a value °77460 for C4. 


TABLE VII. 








- "5 5 

6 5 1 

° 6 56 5 115 

- 6 5 5 5 2 

5 5 5 4) 5 2°5 

663639 OS 3 

- § 5 6 5 % 5 3 

§§ 5 5 8S DG 3 

o -& §. 6.5 3G - 3 
56S. GD = 3 
§ 6 5 5 5 2°5 
5 5 5 5 2 
G HB - Ld 
. eB 1 
D i) 
333333 3 3 3 3] 30 








It is clear that the two methods differ completely in the positions in their respective scales, 
both running from 0 to 1, which are taken by the same table. It certainly seems as if the 
distributions dealt with in Tables I to VII ought to show high relationships between the variates, 
but actually judged by y* they give in all cases, but Table III, remarkably low values. We 


17—2 
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attribute this in the first place to the presence of the factor p,; in the numerator, which as we 
have shown at the beginning of this note tends to make y? vanish when the number of 
probabilities is large and their individual values are small*, except in the case of perfect 
contingency, when the value of y? seems suddenly to spring to unity. I do not therefore think 
that the values of y? can be very satisfactorily distributed through the range 0 to 1, since tables 
which under ordinary circumstances would be assumed to mark a high degree of relationship are 
nearly all crowded into the first quarter of the scale. Beside the advantage of ¢? in this respect, 
and its approach to a finite limit when we pass to continuous variates; the coefficient of mean 
square contingency for an increasing number of cells passes into the correlation coefficient, on the 
hypothesis of normal distribution, and may even before that stage be fairly satisfactorily corrected 
for grouping. 


* One other case was worked out with an extremely small scatter in the arrays, but with 2000 
values p;; differing from zero. Its form is indicated in Table VIII in the form of probabilities. The 
reader has to imagine this a contingency table of 1,001,000 cells with a narrow diagonal strip of only 


TABLE VIII. 








° - oat oe 
4 200 2 x 0 
* 2000 2000 | 1000 
° ° , Tas a Ps ug 
a 2000 2000 1000 
999 such rows . : : 
. e 1 
of selon» 2000 
. = ° e ° . . . ° _ 
2000 2000 1000 
oe 505 e ° ° . . ° ° . | 
2000 2000 1000 
zoloe . ° ° . e ° . ° . >oao 
1 e ° . + Fa rs 
1008 1000 100s TOOT 1000 1000 1 








2000 cells occupied, and think of this in terms of linear correlation! The correlation would be almost 
perfect, Steffensen’s y* for this table is -24975, and C,=*99900, an almost perfect relationship. 


(iii) Spearman’s General Factor Again. 
3y BURTON H. CAMP. 


In a recent papert Piaggio has considered again questions having to do with the uniqueness 
of Spearman’s general factor g, and has held that this uniqueness does not follow from the 
usual hypotheses and has produced further illustrations of cases where “almost uniqueness,” 
as I called it in my Biometrika paper}, does exist. Spearman’s reply, adjoined to Piaggio’s, 
apparently admits Piaggio’s mathematics, and parenthetically mine, but asserts that our 
objections are not wholly relevant. It is tedious to drag out a controversy after the scientific 
evidence is once agreed on and published where everyone may examine it for himself, but 
I cannot but ask a brief space in order to point out as tersely as possible and without adjacent 
forbidding-looking mathematics two quite definite errors which it seems to me Spearman has 
made in his reply. 


First, he says that the sind of indeterminateness that g has is not of the same sort as «, y, 
and z have in the equation of the plane, 


Ax+ By+Cz=0. 


+ British Journal of Psychology, Vol. xxtv. July 1933, pp. 88—105. 
{ Vol. xxiv. November 1932, pp. 418—427. 
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I think, on the contrary, that both Piaggio (p. 96, last line) and I (p. 424, Section 5) have shown 
explicitly that the indeterminaieness of g is of exactly this type. For we have shown that the 
indeterminate factor (£i in Piaggio’s notation, ¢ in mine) cannot be determinate because it need 
satisfy only K+1 equations and it is itself a set of VY unknowns, and J is greater than X+1. 
Similarly, Spearman’s x, y, z cannot be determinate because they need to satisfy only one 
equation and they are themselves a set of 3 unknowns, and 3 is greater than 1. It seems to me 
impossible to recognise a difference in type between these two cases. It might be asserted that 
if V, the number of individuals tested, is small, the range of indeterminateness is limited by 
the numerical possibilities, because the standard deviation of di is fixed, but these limits are 
exceedingly broad, and they increase indefinitely as V is increased*. Therefore such a claim 
could have no meaning from the psychological standpoint, for it cannot be true that the question 
whether individual Smith possesses a certain mental quality depends in some mathematical or 
occult way on how many Joneses are dragged in to be tested with him. 


Secondly, Spearman says that the arbitrary term (42) in the formula for g represents the 
“jnexactitude” of one’s determination of g, not a multiple nature in g itself. The error that he 
appears to me to be making here cannot be described so simply, but, I think, it is very im- 
portant, especially as it is also implicit in the remarks of several other authors. It has been 
shown that, for the given set of X—1 tests and W individuals, there exists not one set of Vg’s 
but an arbitrarily large group of such sets. Spearman appears to think that, because the 
variability of g within each of these sets is limited, it fotlows that the variability of a g from set 
to set for each fixed individual is also limited. This is not true, except in those extreme cases 
when o,;=0, exactly. Unfortunately Piaggio’s language also encourages one to fall into this 
error, for in discussing his illustrative examples he implies that g can be shown to be practically 
unique if o,; for each set is negligibly small. 


But it is easy to select a group of sets of 4’s in which the variability within each set is as 
small as he indicates but for which the variability from set to set for any individual chosen 
arbitrarily in advance is very, very large. It is clearly, however, this last variability which 
must be proved to be small if it is to be shown in this manner that each individual has a 
characteristic g. 


The gist of the matter is really contained in the mathematician’s notion of “order of choice.” 
If, before looking at Smith’s scores on the tests, one may choose a number at random (subject 
only to the broad limitatious mentioned before), and can then demonstrate that this number 
can be assigned as Smith’s g, as well as any other number, and in perfect harmony with all 
the other hypotheses, then it is meaningless to assert that Smith has a g. In the numerical 
illustration at the close of my Biometrika paper, two such Smiths were exhibited (individuals 
11 and 12). 


It is not, of course, contended that a unique general factor does not, in truth, exist, but that 
its existence does not follow from Spearman’s hypotheses. Moreover, it would seem inherently 
impossible that it could follow from his hypotheses, for into them he has introduced only group 
averages, of means, o’s, and 7's. Out of them, therefore, whatever the mathematics, one could 
expect to obtain nothing more significant than characteristics of the group as a whole, and 
indeed this much, to repeat a statement in my other paper, has actually been established in the 
“almost unique” cases. 

* E.g., it is numerically impossible to choose 8 numbers for which the total range of variation is 
greater than 4c, or 50 numbers for which it is greater than 10c. 
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(iv) Note on the Recurrence Formulae for the Moments 
of the Point Binomial. 


By A. A. KRISHNASWAMI AYYANGAR, Mysore University, Mysore, S. India. 


§ 1. It is the object of this note to indicate that the recurrence formulae for the moments of the 
point binomial (p+gq)" about the mean zg can be most simply obtained by a little ingenious 


transformation of the term (¢—nq)* p"~*g! (7) in the series for », the sth moment. This trans- 


formation explains why the recurrence formulae for incomplete moments are as easy to derive 
as those for complete moments, which are particular cases of the former. 


We may write 


(¢—ng)* p"~"g “(; Pang (¢= 1-1. gt py—t pr tg (411) ng (¢—ngye-t preg! (*) 


[since p+q=1] ...... (1). 
t=n 
Let vP= Kod (t¢—2q)* p"— *gt (; ). 
Then 
] n=—1 t=n n 
a 3 y P= et Th ate n=. 8--layn—tpt-1 = — na t—lanr-tat 
ag a 2 l-n qt+p) | p"-'¢ ely Rade ng)*— 1 p"—*@ (*) 
U=n-1 : 4 ee t=n n 
=m (t'—n- 1.g+p)*—'pr-* -1¢8 ( ‘ )- > (tng) pe 'g' i, 
t’=p-1 é t=p t, 
i.e. phan {(B+ pyri ye mel} 0g cscsosesscscecetecsasscesersessncees (2), 


where E(v'?')= v a and ye 1 is the sth incomplete moment defined for the binomial 
(p+q)"—'. Putting p=0, we get the recurrence formula for complete moments 
Peg = 900 fan RG CA) pay a iccs odactesccccvessecsecnsseess (3). 


The formulae (2) and (3) are believed to be new. 


Application: 
For brevity we write v, and »,” for vf? and »,?-1. Putting s=1 in (2), we have 
" ; ‘ ” v1 
v1=(v¥9 —V) Ng, 1.€. vo’ = arr sued easus bvesad sevassawhensuseyen (4). 

But 

t=n—-1 n—)} 

vy" = a > ?- 1 ~tgt ( , ) 
t=p-1 


= m—-1-tot+1( * n—t 7) n-i+1 api Fey) »(*) n-1 (2-1 
hanes 2 [Pp (31) +? ¢(" t f T  MAOe ES et. heck 
tnat (°*7’)- n—-) ( n ) sme | 
' ie 1 )P+( ra) 9 t+1)% 
v=n t=n- 
_— -] n—1 
= 3 n—-t ’) > ‘[e n-t (n— ‘)- n-t+1,t+1 n na*-1 
rope # ( of Sa eC , P On \e41) [Pl Ng 1) 


i.e. 19 =v +p" Pt+1¢p-1 ia pobch onacapeisegiansuassacvasane ties (5). 
‘ me 
Comparing (4) and (5), we get 
vy =np"—P*1¢p CS =ppr-erige(™ i (6) 
p- i 7 ap | Seeneeeerrnies art eer re 5 


* This result, for which we have given here a simple proof, is due to Ragnar Frisch. See Scandi- 
navisk Actuarietidskrift (1924), No. 3, p. 161, quoted in Biometrika, Vol. xvit. (1925) p. 170. 








eto | 
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Hence ne Veal oh deca) pos! eRe ee eee oe So (7). 
Similarly by setting s=2, 3, ... in (2) we calculate in succession 
V2 =v; (p—n+l1. q) + vonpg* J 
(p-1) 
ng 
vy =, {((p—ntl. 7 +pq (2n —1)} + vonpg (p -9)* 


v2" =r {p- P+ -p+1} +rypq(n=1) 


” ee) ee ee NN ae ee 8). 
va" = 21 {(p-1)[(p—1 —ng)* + pq. 2-3] + pg (P—9) (—1)} (8) 


bony ree +9 (n—1) pg (p-g) 
vy =r [(p—n+1.9)5+3pq (n—1) (p—n+1.9)+p9(p—g) (Bnr—1)] 


+ vo [npg (1+3pq.2-2)] 
and so on 





/ 


In the above expressions, the coefficients of v, are the complete moment coefficients and follow 
the recurrence-relation (3); and it must be possible to discover a recurrence relation for the 
coefficients of v; also (see the following section). 


§ 2. We will now give a simple proof of Pearson’s recurrence formula t, which enables us 
also to write down immediately the corresponding recurrence formula for incomplete moments 
given by Ragnar Frischt. 


We transform (¢—2q)*~! p"~*¢! in two different ways : 


g 
1) pt (t—nq)s-! p»—tgt (¢ —nq)* p™—tgt ”\ +-ngp (t— ng)*-1 p-tgt (" . 
r OT ee aE Ae dP Ellas ae 22. 


(2) pt (t—ng)*—1 p— tat Ghee 1 oem a a 1(." )@-t40¢ 


=(t—agy* eo itig (," 1) (mae -q.t-1=ng) 
since p+q=1. 
*. p{(t—ng)*+ng (t—2ng)?-}} p"-*g (F)=(-ngytpe- i=Tgi-1 i”) (ngp—q.t—1—nq) 
a : (9). 
“ps i Saaiabienion 


ie ~ *{ ee ve ee \ — -- 
(t’—ng+1)8—! p»-" gt (7) (ngp —q.t' —ngq) 

ee i | t'/) 

v=n 


= = a —ng+1)8—! prt g" ()} (ngp—q.t'—2q) 
v=p 


+(p—2ng)*-! pre +1ge-1 (1) (ngp-—q-p—1—n79) 


v=n ——w 
= > {(e—ng-tiymt peg! (7) (ngp —q.t' — 2g) + (p— ng)?! pp™—P*1 9g? (*), 
U=p 
ie, p(B + ng E8—?) vp=(£41)8-! (ngpyo— gui) +1 (p — 2g), 
i.e, {(1+ £)8—!— B8-}} (ngpvg — qi) =Ve— vy (DH NG)PW! cecceececcecsecceees (10). 


If we put here p=0, then v;=0 and we get Pearson’s formula. 


* These results were given previously by Ragnar Frisch. 
} Biometrika (1924), Vol. xvi. p. 160. 
t Ibid. (1925), Vol. xvi. p. 171. 
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From (8) we see that », is of the form v,A,+vous, Where Ay»=O and Ay=1. Using this 
expression in (10) and equating the coefficients of »;, we get the recurrence relation for \,, viz. 


{(1 + E£)*-! — E*-}} (ngpro — gry) =Ag— (9 —NG)PW  vaceeesseeeseeeeneees (11), 


which was not obvious from the other formula (2), 


(v) A Note on the Incomplete Moments of the Hypergeometrical Series. 
By A. A. KRISHNASWAMI AYYANGAR, Maharajah’s College, Mysore, South India. 
1. The aim of this note is to indicate an extremely simple and elementary method, applicable 


alike to complete and to incomplete moments, of deriving the recurrence formula* due to 
Karl Pearson for the moments of the hypergeometrical series defined by 





Pe ipe snipe PT) ft) 5 tee gige=3) 
n(n—1)...(n—7 +1) 1’ pn—-r+1 1.2 °(pn—r+l)(pn-rt+2)> 
Here the coefficient of w', which we may denote by ¢(¢) for brevity, is the probability of drawing’t 
individuals without some mark in a sample of 7 individuals selected at random from a population x 
containing pn marked and gn unmarked individuals. The mean of the samples is evidently rq. 


The incomplete moment of the sth order about the mean is defined by 


= 
ve= = (t—1rgq)* p(t). 
t=p 


2. The following identities are self-evident : 


(i) t (pn—r+t)=n (t-rq)+(r—t) (qn—d); 
(ii) (r=t) (qn—t)=(t-7q)*— 4 (t- 79) +e, 
where q=ngtr(p—g) and ce=rpg(r-7r); 
(iii) t (pn—r+t) b(t)=(r—-t=1) (qr—-t-1) p (t=). 


Multiplying both sides of (iii) by (¢—7qg)*-! and summing for values of ¢ from p to r, we have 
using (i) and (ii), 


op (t) (¢—1q)*"! {n (t—1rg) + (t— 1g? — 1 (¢-— 7g) + eg} 
p 


iMwe 


t 
r — — 
= 3 d(t—1)(¢-rgq)**! (r—t-—1)(qn-t—-1) 
t=p 


1 


= = 


= p(t’) (t’—rq+1)*-1(r-¢) (qn—-¢); where ’=t-1, 
Ut=p-—1 


- 
= = d(t’)(-—rq+1)*~ 1 {(t —rgq)?-—¢ (t —1rgq) +9} 
t=p 


+¢ (p—1). (p—7rg)*"! (r—p+1) (qn—-p+1). 
Introducing the operator # to denote the operation 


E' (v,)= 5415 
we write the above result in the form 


ny, + B®} (vy — 171 + CQv9) = (1+ £)8-! (ve — 1 71 +0219) +(p —7q)*"! p (prn—7r+p) > (p), 
{((1+ £)8~! = B*~}} (vg —¢, 91 +09) =n, —(p —17q)*—! p (pn—r +p) > (p). 


* Biometrika, Vol. xv1. (1924) p. 159. 
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Put s=1, nvy=p (pr—rt+p) >(p), 


i.e. ” es $b siscccccoondane (1). 


Hence, we may write the recurrence formula for incomplete moments in the form 
{11 + £)8—! — £8-}} (vg—cy 91+ Cgvy) = nv,— 2 (p—7g)>- 1H ..... 000 Rroeeree: (5 
This shows that we may put 
Ve=PsVoFAsV}, 
where Ag=M=0, Ay=po=1F, 


in (2) and equate the coefficients of »; and v9 and obtain recurrence formulae separately for 
A, and ps. 


In fact, the recurrence formula thus obtained for ps is the same as Pearson’s for complete 
moments, while that for A, is 


{(1+ £)8—! — 8-3} (Ag — cy Ay + Cody) = Ag—(P—7G)P~ NE cecseeeceeeeseeees (3). 
Put 8=2, Ay—cy=NAg—2(p—7q), 
ie. N= Merg.r+t—ripa® nibihes bedahe dda eee (4). 


Put s=3, (1422) (Ag—cydy+c2Xo) =2As + (1 — My) Ag + (Bey —c,) = MA — 2 (p—7g)*. 


fa-ra)—c 
Therefore (wm — 2) As=2 (p—rg)?+(1 — 2c) es _ ~ “14-269 —Cy, 
1 = 
SO en es 2h(n—y 2 oy — ’ 
As (n—1) (@—3) [n? {(p —r+19)?+pq (27 -1)} 
—n{(p—r+1.9)*+(p—rt+1. 9) (2r-1) (p-gq) 
+ pg (2r2— 2r — 1) + 7r}4+ Br? (1 — Bpq)] «.......seeeeeceeseseceeeees (5). 


When p=0, it follows from (1) that »,=0 and obviously »»>=1, so that v,=p,, which satisfies 
Pearson’s recurrence formula. 


(vi) On Simometers and their Handling. 
By K. PEARSON. 


The original purpose of the simometer was to measure the flatness of the nasal bridge by the 
ratio of its subtense to the chord. For this purpose we may take either the simotic or the rhinal 
index, or indeed both. But for the purposes of this note we are not concerned with what terminals 
may be chosen for our fundamental chord ; we are concerned only with what is the definition of 
our “subtense.” Probably to most mathematicians the subtense is the portion of the perpendicular 
bisector of the chord intercepted between the chord and the are. It was from this conception of 
a subtense that Newton proceeded to his definition of curvature. If the are while still an arc of 
continuous curvature be not symmetrical about the chordal bisector but skew to it, then the line 
from the point of chordal bisection to the point of contact of the tangent parallel to the chord 
has been considered—as in the cases of parabola and ellipse—as the subtense. 


Now in the case of the nasal bridge when the two sides of the nose are not mirror images the 
one of the other, the length measured as “subtense,” either by the simometer of Mérejkowsky 
or by the co-ordinate calipers of Woo (see pp. 197—98 of this issue), is not the mathematical sub- 
tense as just defined. What, however, is of more importance, the two instruments do not, or 
should not, give the same result, and even the same instrument will give different results according 


* This is a generalisation of Frisch’s result given in Biometrika, Vol. xvit. (1925) p. 170 (18). 
+ These values have to be used only after the operator Z has had its effect, 
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to the manner in which its use is prescribed. Both Mérejkowsky’s and Woo’s simometers possess 
two degrees of freedom for the measuring tip of the mid-arm. We will consider their geometrical 
aspects separately. 


Let ab be the chord and afdb the arc of which the subtense is to be measured. Then when 
the tips of the curved outer arms of Mérejkowsky’s simometer are placed on a and 6 there is 
a fixed point g of the instrument at equal distances from aand 6. Through g a line passes (actually 
a rod), the end of which, d (the tip of the rod or mid-leg), can be adjusted to any point of the area 
agb by a turn round g and a forward motion of the tip d produced by a screw reading to jy mm. 
If gd be taken to bisect the angle agb, then Mérejkowsky’s will enable us to read off the subtense 
cd, that is the length of the line through the bisection of ab, perpendicular to ab, and reaching 
to the curve. This requires two readings of the instrument, first to measure gd directly, and 
second to measure gc; the latter can only be obtained by retaining the outer legs at the distance 
ab, removing the simometer from the nasal bridge and, without rotating gd, briaging d down toc 
so that the lips of the three tips of the legs all touch a plane. 


If the tip d be not kept to the perpendicular bisector cg, but directed to some point / on the 
arc, ten Mérejkowsky’s simometer measures /7, which has no pretence to be a subtense, and may 
be greater than the maximum distance fe of the arc from the chord. The true maximum subtense 
will only be given by this instrument when the nose is symmetrical and care is taken that the 
mid-leg bisects the angle agb. This simometer provides no means of ascertaining directly the 
point f of the arc most remote from ab, and if the tip of the mid-leg be brought to f the subtense fj 


a Jj Cc 
\ 















Fig. 1. ag=gb, cg is a perpendicular bisector of ab, f is highest point of are afb, 
and kh is an arbitrary point between f and d. 


might in a very skew nasal bridge be such that 7 did not even fall inside ab! Further, we have 
seen that f cannot be ascertained from this instrument. What the simometer can achieve is the 
ascertainment of the minimum distance from ab of the trace of the nasal bridge on the plane 
perpendicular to and bisecting the chord ab. 


We now turn to Dr Woo’s simometer or his co-ordinate calipers*. 


It has three legs perpendicular to a bar, of which the second (gf) and third (mb) can be shifted 
along the bar and the second or mid-leg moved also perpendicular to the bar : all three legs and the 
bar may be discussed geometrically as coplanar lines. Fixing the external Ja and mb with their 


* This name does not seem very well chosen. 
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tips at a and b, the mid-leg, gz/, has two degrees of freedom, and there is no difficulty in bringing 
it into the position f which enables the maximum subtense of the chord ab to be read off. By 
turning the instrument round a and reading off a number of maximum subtenses like ef, we can 
find their minimum, which is probably the quantity of most value to the anthropologist. But - 
unfortunately Dr Woo does not appear to have used his simometer in this manner, any more than 
the later workers in the Biometric Laboratory have used Mérejkowsky’s simometer with the 
mid-leg bisecting the angle between the chords of the external legs. The procedure in both cases 
has been apparently to draw on the nasal bridge a line supposed to represent the nasal bridge, 
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Fig. 2. ¢c is mid-point of ab, al=bm, f is highest point of asymmetrical are afb, 
efng is perpendicular to ab, f’ is an arbitrary point, f’e’ is perpendicular to 
ab, g is a point on efn produced, and Im is produced beyond m to some 
distance. 


and this seems to have been done without any reference to the chord ab. Its obvious purpose 
is to get rid of one of the two degrees of freedom, which both irstruments possess when working 
in a definite plane. But if we pay no regard to the chord ab, f may well be at 7” instead of the 
point at maximum distance from ab, and accordingly Dr Woo’s simometer would measure /’ e’, 
not fe. Even with a nasal bridge which without regard to sutures had perfect mirror symmetry, 
both simometers would fail to give the maximum subtense if @ and 6 were not images one of the 
other in the mirror plane of that nasal bridge, and we aimed our mid-leg tip at the point where 
the arc met the mirror plane. 

To sum up: Both instruments seem to have been used in a manner which, at any rate 
theoretically, invalidates their purpose, if that purpose be to find the minimum subtense (per- 
pendicular to ab) of all the maximum subtenses of sections of the nasal bridge by planes 
through ab*. 


Woo’s instrument could give the required result, but with more labour than is necessary 
to obtain a doubtful result from an arbitrary median nasal ridge. Mérejkowsky’s simometer 
could never give in asymmetrical nasal bridges (including the asymmetry of @ and 6 as a part of 
the nasal) the minimum of the maximum subtenses, but if we start by defining the median nasal 
ridge to be the trace on the nasal bridge of the plane perpendicularly bisecting ab, it will give the 
minimum subtense in this plane by simply causing the mid-leg to bisect the angle at g. No 
system which does not define the median nasal ridge with reference to ab can it seems to me 
be really satisfactory. 


* Optically we may suppose the eye placed at a great or infinite distance along the line ab produced. 
Such an eye would see the nasal bridge as a plane curve (although all the points of the curve might not 
be coplanar), and the point on this curve nearest to the line ab would give the required subtense. 
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The optical conception could be carried out excellently with the cranial co-ordinatograph *. 
Place the skull by aid of it with ab perpendicular to the drawing board, and trace with the knife 
edge projector the contour of the nasal bridge ; the minimum distance from the plan of the line ab 
to this contour would be the subtense sought. This frees us from any arbitrary choice of a median 
ridge for the nasal bridge or from the supposition that the maximum subtenses of all the 
sections through the chord lie in one plane, the trace of which can be determined by personal 
appreciation. 

I am inclined to believe that the simometer of Mérejkowsky, with its choice of the plans of 
bisection of the chord as plane for measuring the subtense, is at least as accurate as Dr Woo’s 
calipers applied to an arbitrary median ridge which pays no attention te the chord. What the 
latter method would achieve if used as a three-degree of freedom instrument to determine the 
minimum of the maximum subtenses it is difficult to say without experimenting. What 
theoretically seems to me erroneous is the determination of a nasal ridge without direct reference 
to the chord ab. That the two instruments do not lead to widely, although they do to signifi- 
cantly different results is consoling to the craniologist, if not to the theorist. There are certainly 
some erratic differences in the tables on p. 200, which require clearing up. 


It may not be out of place to recall here a method I introduced in 1911 for determining the 
flatness of the nasal bridge. A fine celluloid tape or even a thread may be used to measure 
the shortest arcual or geodesic distance from the terminals of the chord ab across the nasal 
bridge. A steel tape is unsuitable. The chord ab is also measured. A very suitable measure for 
the flatness of the nose is then 

8=100 (arc — chord) /are. 
This is adequate without any determination of the subtense. 


But, if we desire to find the subtense, it is needful to choose a curve sufficiently elastic to 
represent the nasal bridge. The circle was no use, the parabola somewhat better, but finally, after 
drawing a good many nasal bridge sections, I concluded that the catenary gave the best result. 
If then 

a= 100 subtense/chord, 


it is needful to compute a for each value of 8. Tables to find a given 8 were calculated by H. E. 
Soper and Dr Julia Bell t. 


It must be admitted at once that like Mérejkowsky’s simometer this method starts with the 
assumption of a symmetrical nasal bridge. But if the nasal bridge be skew, the increased length 
of arc will to some extent compensate in an increased subtense, which compensation is lacking in 
the case of the simometer. 

It would, I think, be of actual practical value, especially as the racial value of the simotic 
index is becoming more generally acknowledged, to take a series of, say, one hundred skulls, and 
measure the chord ad, the arc afb and the minimum of the maximum subtenses, using 

(a) The cranial co-ordinatograph to give “the true value” ; 

(b) Mérejkowsky’s simometer with the mid-leg as bisector ; 

(c) Woo’s simometer, with no attempt at personal appreciation of a median nasal ridge, and 


(ad) The catenary method. 


Such an experiment would enable us, with due regard to accuracy of result, to the time and 
to the labour involved, to select the practically best method and to discard the others. 


* See Biometrika, Vol. xxv. pp. 217—253, 

+ Published originally in Biometrika, Vol. vu. pp. 316, 318 and Vol. rx. pp. 401—2; reissued 
Tables for Statisticians and Biometricians, Part 1. pp. lvi—lvii, $2—64. 

+ It would not be difficult to devise a mechanically less crude instrument of the Mérejkowsky type 
than the present one. 








